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Preface 


The basic sciences and physics, in particular, are of prime im- 
portance today in the training of engineers for the various 
branches of the national economy. This has led to fundamental 
changes in recent years in the teaching of physics in engineer- 
ing institutes, and to the students of other educational insti- 
tutions in which they do not major in physics. The scope and 
scientilic level of physics courses have been substantially sup- 
plemented and cover the main trends in the development of 
modern physics. Consequently, the physics textbooks for en- 
gineering students have inevitably become three-volume edi- 
tions of almost fifteen hundred pages. The need has arisen, 
in this connection, for a concise handbook on this subject. 
The aim of the authors was to fulfill this need. 

In scope and depth this handbook includes all of the defini- 
tions, formulas and information covered in the most comprehen- 
sive and up-to-date physics courses of engineering institutes 
and the physics departments of universities and colleges. Physi- 
cal laws are concisely formulated, all the necessary explana- 
Lions are given and, in many cases, derivations as well. Though it 
plays a vital role in the teaching of physics, experimental ma- 
terial has been omitted. This is due only to a lack of space. 
All the units and symbols comply with the requirements of the 
ST. Units of physical quantities and systems of units are listed 
and dealt with in a short appendix. 

This handbook is designed primarily for engineering students, 
as well as college and university students studying, but not 
majoring in physics. It can be used to advantage by engineers 
and graduate students, as well as by instructors and lecturers of 
intermediate schools and colleges. 
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Mathematical knowledge required in using the handbook is 
within the scope of the ordinary mathematics courses of engi- 
neering institutes. The detailed index and the numerous cross 
references, indicating the chapter, section and subsection, are 
of aid in finding any required information. The authors are 


sincerely grateful to the translator, N. Weinstein, and the. 


science editor, T. Baranovskaya, for their comprehensive and 
expert work in the publication of the English edition of this 
handbook, as well as for their valuable suggestions, which should 
facilitate its use by the reader. 


B. Yavorsky 
A. Detlaf 


Parts One, Four, Five and the appendices were written by 
Prof. A. A. Detlaf; Parts Two, Three, Six, Seven and Eight, 
by Prof. B. M. Yavorsky. 
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PART ONE MECHANICS 


CHAPTER 1 KINEMATICS 


1.1 Mechanical Motion. The Subject Matter 


of Mechanics 

i 
1.4.14 The simplest form of motion in nature is mechanical 
motion. It consists of changes in the relative positions of bodies 
or their parts in space in the course of time. The branch of phys- 
ics dealing with the laws of mechanical motion is called me- 
chanics. In a more narrow sense of the word, this is often under- 
stood to mean classical mechanics, which deals with the motion 
of macroscopic bodies travelling at velocities a great many 
times less than that of light in free space. Classical mechanics 
is based on Newton’s laws and is therefore frequently referred 
to as Newtonian mechanics. The laws of motion of bodies at 
velocities close to that of light in free space are taken up in 
relativistic mechanics (5.1.1), and the laws of motion of micro- 
particles (for example, the electrons in atoms, molecules, crys- 
tals, etc.) are the subject matter of quantum mechanics (38.1.1). 
1.1.2 Classical mechanics comprises three basic branches: sta- 
tics, kinematics and dynamics. Statics deals with the laws for 
the composition of forces and the conditions for the equilibrium 
of bodies. Kinematics pees a mathematical description of all 
possible kinds of mechanical motion, regardless of what causes 
cach particular kind of motion. Dynamics investigates the effect 
of the interaction between bodies on their mechanical motion. 
1.1.3 The mechanical properties of bodies are determined by 
their chemical nature, internal structure and state. These are 
beyond the scope of mechanics and are studied in other 
branches of physics. Hence, in accordance with the conditions of 
vach specific problem on hand, various simplified models are 
employed to describe real bodies in motion. These include the 
particle, the perfectly rigid body, the perfectly elastic body, 
the perfectly inelastic body and others. 
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A material poini, or particle, is a body whose shape and size 
are of no consequence under the conditions of the problem 
being considered. For example, in a first approximation, the 
motion of a ship from one point to another can be dealt with as 
the motion of a particle. 1f, however, it becomes necessary to 
take into account such details of this motion as the pitching: 
and rolling of the ship by swells or heavy seas, the ship should 
be dealt with as an extended body of definite shape. In the liter- 
ature the word particle is more frequently used than mate- 
rial point. 

Any extended body or system of such bodies, making up the 
mechanical system being investigated, can be regarded as a 
system of particles. For this purpose, all the bodies of the system 
can be conceived of as being broken down into such a large 
number of parts that the size of each part is negligibly small 
compared to the sizes of the bodies themselves. 

1.1.4 A perfectly rigid body, or simply rigid body, is one whose 
deformation can be neglected under the conditions of the given 
problein. The distance between any two points ol a rigid body 
remains constant no matter what action the body is subjected 
to. A perfectly rigid body can be regarded as a system of par- 
ticles rigidly connected to one another. 

A perfectly elastic body is one whose deformation obeys Hooke’s 
law (40.3.4). After the external forces applied to such a body 
have been removed, its initial size and shape are completely 
restored. 

A perfectly inelastic body is one that completely retains its 
deformed state, caused by external forces applied to the body, 
when these forces have been removed. 


1.2 Frames of Reference. Path, Path Length 
and Displacement Vector of a Particle 


1.2.1 The position of a body in space can be specified only in 
reference to other bodies. It makes sense, for instance, to speak 
of the location of the planets with respect to the sun, that of 
an air liner or steamship with respect to the earth, but we 
cannot indicate their position in space “in general” without ref- 
erence to some specitic body. A perfectly rigid body in which 
a coordinate system is rigidly fixed, furnished with a “clock” 
and employed to determine the position of bodies and particles 
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being investigated in space at various instants of time, is called 
a frame of reference. Sometimes the frame of reference is the 
coordinate system itself if it is provided with a “clock” and 
the solid in which the system is rigidly fixed is called the body 
of reference. In each specific problem the frame of refer- 
ence is selected so as to simplify the problem to the maximum 
extent. Inertial frames of reference (2.1.2) are the most com- 
monly used in physics. 

1.2.2 Most extensively employed is the Cartesian rectangular 
system of coordinates (Fig. 1.4) whose orthonormal (normal or- 
thogonal) basis is formed by the 
three vectors i, j and k, which 
are unit vectors with respect 
to their magnitudes, mutually 
orthogonal and pass through 
the origin O of coordinates. The 
position of an arbitrary point 
M is specified by the radius 
vector r, which connects the ori- 
gin O of coordinates to point M. 
Vector r can be resolved along 
the axes i, j and k of the basis 
as follows: 


r = vi + yj + 2k, Fig. 1.4. 


where zi, yj and zk are the com- 

ponents of vector r along the 

coordinate axes. The coefficients of resolution, z, y and z, are 
the Cartesian coordinates of point M and, due to the orthogo- 
nality of the vectors of the basis, the projections of radius 
vector r on the corresponding coordinate axes. 

The motion of a particle is completely specified if three con- 
linuous and single-valued functions of the time ż, 


v = z (t), y = y (t) and z = z (t), 


are given to describe the changes in the coordinates of the 
particle with time. 

These equations are said to be the kinematic equations of motion 
of a particle. They are equivalent to the single vector equation 
r =r (ż)}, for a moving particle. | 
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1.2.3 The line described by a moving particle in space is called 
its path. The kinematic equations of motion of a particle spec- 
ify the equation of its path in parametric form (the parameter 
being the time ż). Depending upon the shape of the path, dis- 
tinction is made between rectilinear and curvilinear motion. 
The motion of a particle is said to be plane if its path lies com- 
pletely in a single plane. 
The mechanical motion of a body is relative, i.e. the nature of 
the motion and, in particular, the kind of path travelled by 
the points of the body depend upon the chosen frame of re- 
ference. 
1.2.4 In the general case, the path of a moving particle is a 
spatial, rather than plane, curve. Introduced for such curves is 
the concept of an osculating 
plane. The osculating plane at 
the arbitrary point M of a 
curve isin the limiting position 
fa plane passing through any 
three points of the curve when 
these points approach point M 
without limit. 
An osculating circle at point M 
of a curve is the limit ofa circle 
passing through three points 
of the curve being considered 
when these points approach po- 


x int M without limit. The oscu- 
i lating circle lies in the oscula- 
Fig. 1.2 ting plane. The centre of the oscu- 


lating circle and its radius are 

called the centre of curvature and 
radius of curvature, at point M, of the curve being considered. 
The straight line connecting point M to the centre of curvature 
is called the principal normal to the curve at point M. The tan- 
gent to the curve at point M is perpendicular to the principal 
normal and also lies in the osculating plane. 
1.2.5 The path length is the sum of the lengths of all the por- 
tions of the path over which the particle passes in the given 
interval of time. The instant of time t = tẹ, from which we 
begin to consider the motion of a particle, is called the initial 
instant of time, and the position of the particle at this instant 
(point A in Fig. 1.2) is called its initial position. Owing to the 
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arbitrary nature of the time reference point it is usually as- 
sumed that ¢, = 0. The path length s, travelled by the particle 
from its initial position, is a scalar function of time: s = s (t). 
As is obvious from its definition, the path length of a particle, 
cannot be a negative quantity. If the particle travels along an 
arc of the path AB (Fig. 1.2), always in the same direction and 


E 
is at point M at the instant of time ż, then s (t) = AM. If the 
particle travels along the path in a more complicated way, for 
example if it travels from A to B, reaching B at the instant of 
time ¢; < ¢, and then reverses, returning to point M at instant t, 


yon 
then s (t) =B + BM. | oe 
1.2.6 The displacement vector of a moving particle during the 
time interval from t = t, to t = t is the vector from the, posi- 
tion of the particle at the instant of time t; to its position at 
instant #,. It is equal to the increment in the radius vector of 
the particle during the time interval being considered: 


ra — Ty =T (ta) — T (t). 


The displacement vector is always directed along the chord 
subtending the corresponding portion of the path. 

Shown in Fig. 1.2 is the displacement vector of the particle for 
the time interval from ¢, to ¢. It is equal to r — rọ = r (t) — 
— r (t). eee 
The displacement vector of a moving particle for the time inter-- 
val from ¢ to t+ At equals | 


Ar =r (t + At) — r (t) = Ax X i+ Ay X j+ Az X k, 


where Az, Ay and Az are the increments (changes) in the coordi- 
nates of the particle during the time interval being considered. 
1.2.7 A particle travelling freely in space can accomplish only 
three independent motions, i.e. motions that cannot be repre- 
sented as combinations of the other two. As a matter of fact, 
the motion of a particle along each of the axes of a Cartesian 
rectangular coordinate system cannot be obtained by its motion 
along the other two axes. The number of independent motions 
that a mechanical system can accomplish is called the number 
of degrees of freedom of the system. Hence, a free particle has 
three degrees of freedom. 
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1.3 Velocity 


1.3.14 The concept of velocity is introduced in mechanics to 
characterize the rate of motion of bodies. It is the time rate of 
change in the position of moving bodies. The average velocity of 
a moving particle in the time interval from ¢ to t + At is the 
vector Vay, equal to the ratio of the increment Ar in the radius 
vector of the particle during this time interval to the length of 
the time interval At: 


Ar 
vav="Are 


Vector Vay has the same direction as Ar, i.e. along the chord 
subtending the corresponding portion of the particle’s path. 
1.3.2 The velocity (or instantaneous velocity) of a moving particle 
is the vector quantity v, equal to the first time derivative of 
the radius vector r of the particle being considered: 


_ a 
dt ° 

The velocity of the particle at the instant of time ¢ is the lim- 
iting value that the average velocity vay approaches for an 
infinitely small value of the time interval At. Thus 


: Ar ; 
v=lim ——= lim Vav; 
At=0 At Mto 


The velocity vector v of the particle has the direction along a 
tangent to the path toward the motion in the same way as the 
vector dr = v dt of infinitely small displacement of the particle 
during an infinitely short time interval dt. 
The path length ds travelled by the particle during the length of 
time dt is equal to the magnitude of the displacement vector: 
ds = | dr |}. Therefore the magnitude of the velocity vector of a 
moving particle is equal to the first time derivative of the path 
length: 

ds 
p=\|v]= 


dt ° 
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1.3.3 The resolution of vector v along the basis of a Cartesian 
rectangular coordinate system is of the following form: 


v= Vyit-vyj+v_k. 


‘he projections of the velocity of a moving particle on the coor- 
dinate axes are equal to the first time derivative of the corres- 
ponding coordinates of the particle: 


dx dz 
Pe Gps A E 


and the magnitude of the velocity vector, sometimes called the 
speed, is 


1.3.4 In rectilinear motion of a particle, the direction of its 
velocity vector remains constant. The motion of a particle is 
said to be uniform if the magnitude of its velocity does not 
change with time: v = ds/dt = const. The path length s tra- 
velled by a uniformly moving particle is a linear function of 
time: s = vt (provided that ż = 0, see Sect. 1.2.5). 

If the magnitude of the particle’s velocity, i.e. its speed, increa- 
ses with time (dv/dt > 0), the motion is said to be accelerated; 
if the speed decreases with time (dv/dt < 0), the motion is said 
to be decelerated. 

1.3.5 The average velocity of nonuniform motion of a particle 
over a given portion of its path is the scalar vay, called the 
travelling speed, which is equal.to the ratio of the length As of 
this portion to the length At of the time interval required to 
. travel over this portion: 


As 


a 


It is equal to the speed of uniform motion in which the time 
required to travel the same path length As is the same as in 
the nonuniform motion being considered. 

In curvilinear motion of a particle | Ar | < As. Therefore, in 
the general case, the average velocity (travelling speed) Vay 
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of the particle is not equal to the magnitude of its average 
velocity Vay over the same portion of its path (4.3.1): vay > 
=> | Vay |, where the equality sign corresponds to a rectilinear 
portion of the path. 

1.3.6 In the case of plane motion of a particle M (4.2.3), it 
often proves convenient to use the polar coordinates r and 9g, 
where r is the distance from pole 
O to particle M and @ is the vec- 
torial angle measured from polar 
axis OA (Fig. 1.3). The velocity 
v of particle M can be resolved 
into two mutually perpendicular 
components: the radial velocity v, 
and the transverse velocity Vo: 


V = Vr + Vos 


in which 


Fig. 1.3 4dr _ dg 
camer arn and vo Gr [kr], 


where r is the polar radius vector of particle M, and k is the 
unit vector directed perpendicular to the plane of motion of 
the particle so that the rotation of radius vector r upon an 
increase in angle @ is seen, from the head of unit vector k, to 
be counterclockwise. 

The magnitude v of the velocity vector of particle M, in plane 
motion, is 


= V (ar) + (ra) - 


During the infinitely short length of time di, polar radius 
vector r of the particle in plane motion sweeps over a sector of a 
circle, the area of the sector being dS = r*dq/2. Hence, the 
quantity 


dS 1 dọ 4 
Deas SRE, ry PEE ee 
d 2° u 2 o 


is called the areal velocity. 
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1.4 Acceleration 


1.4.1 The concept of acceleration is introduced in mechanics to 
characterize thé rate of change of the velocity vector of a mov- 
ing particle. The average acceleration in the time interval from 
/ to t + At isthe vector aay, which is equal to the ratio of the 
increment Av of the velocity vector of the particle, during the 
time interval, to the length of the time interval At: 


Av 
Mav—=TE 


1.4.2 Acceleration (or instantaneous acceleration) of a moving 
particle is the vector quantity a, equal to the first time deriv- 
ative of the velocity v of the particle being considered or, other- 
wise, the second time derivative of the particle’s radius vector r: 


_ Wve 
~ at dt? * 


The acceleration of a moving particle at the instant of time t 
is the limiting value that the average acceleration ay ap- 
proaches for an infinitely small time interval At. Thus 


í Av : 
a= lim v——_— =< lim aav- 
At > 0 Al At +0 


1.4.3 The resolution of vector a along the basis of a rectangular 
Cartesian coordinate system is of the form: 


a = ayi + a,j + ak. 


The projections of the acceleration vector of a moving particle 
on the coordinate axes are equal to the first time derivatives of 
the corresponding velocity projections or, otherwise, the second 
a derivatives of the corresponding coordinates of the- par- 
ticle: 


PEN dvy Qx ; dvy _ @?y _ dv; dz 
x= diz? “UG ae 72 ht dee’ 
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The magnitude of the acceleration vector is 
= dv, \? dvy \? dv, J 
o=lal=V | ra a ne) 


ra dx \2 d2y \" d2z j 
=} (aa) Ee HA i 
1.4.4 The acceleration vector of a moving particle lies in the 
osculating plane (1.2.4) passing through point M of the path 
being considered and is direct- 
M r ed toward the concavity of the 
Oz path BC (Fig. 1.4). In this plane 
n the acceleration vector a can 
| C be resolved into two mutually 
----- a perpendicular components a, 
and an: 


a= az,-+ ap. 


1.4.5 Component ar is called the tangential acceleration of the 
moving particle. It is directed along the tangent to the path 
and equals 

dv dv 


ap T and ar =—— 


Fig. 1.4 


where t = v/v is the unit vector of the tangent, from point M 
of the path in the direction of velocity v of the particle, and a, 
is the projection of the tangential acceleration on the direction 
of vector v. The tangential acceleration characterizes the rate 
of change of the magnitude of the particle’s velocity vector. 
Vectors a; and v coincide in direction, i.e. az > 0, in accelerat- 
ed motion (1.3.4); vectors a; and v are opposite in direction 
i.e. ay < 0, in decelerated motion of the particle; and a, = 0 
in its uniform motion. If a, = const Æ 0, the motion is said to 
be uniformly variable. In uniformly variable motion the magni- 
tude of the particle’s velocity is linearly time-dependent: 


V = Ug F axl, 
where v = v (0) is the magnitude of the initial velocity, i.e. 


the velocity at the initial instant of time ¢ = 0. Ifa, = const > 
> U, the motion of the particle is said to be uniformly acceler- 
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ated, whereas if a, = const < 0, it is said to be uniformly de- 
celerated. 

1.4.6 Component a, of acceleration a of the moving particle is 
called its normal acceleration. It is directed along the principal 
normal to the path at point M being considered, toward the 
centre of curvature (1.2.4) of the path. Therefore, a, is often 
called the centripetal acceleration of the particle. The normal 
acceleration equals 


y2 
R 
where n is the unit vector of the principal normal, and R is the 
radius of curvature of the path. The normal acceleration charac- 
terizes the rate of change in direction of the particle’s velocity 
vector. 

If the particle travels in a straight line the normal acceleration 
a,, = 0 and the acceleration is equal to the tangential accele- 
ration of the particle: a = aj. 


n, 


an = 


1.5 Translational and Rotary Motion of a Rigid Body “` 


1.5.1 Translation is the kind of motion of a rigid body in which 
any straight line rigidly fixed in the body (for instance, line 
AB in Fig. 1.5), remains paral- 
lel to its initial position (A80). 
With respect to the earth, a lift 
cabin (elevator car), lathe tool, 
and many other items have 
translational motion. 

In the translational motion of a 
rigid body all of its points are 
displaced in an identical way. 
During the infinitely short time 
di the radius vectors of these 
points are changed by the same ___. 
amount dr. Hence the velocity tig. 1.5 

of all points of the body is the 

same in each instant of time and is equal to dr/dt and, conse- 
quently, their acceleration is also the same. For this reason, 


* Only perfectly rigid bodies are dealt with in this section; for 
the sake of brevity, they have been called simply “rigid bodies”. 
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a kinematic discussion of the translational motion of a rigid 
body reduces to the investigation of the motion of any one of 
its points, or particles. Dynamics usually deals with the motion 
of the centre of mass of the body (2.3.3). A rigid body travel- 
ling freely in space has three translational degrees of freedom 
(1.2.7), corresponding to its translational motions along the 
three coordinate axes. 
1.5.2 The motion of a rigid body in which two of its points, A 
and B, remain fixed is called rotation (or rotary motion) of the 
body about a fired axis. The fixed straight line AB is the axis 
of rotation. In the rotation of a rigid body about a fixed axis 
all the points of the body describe circles whose centres lie on 
the axis of rotation and whose planes are perpendicular to this 
axis. The rotors of turbines, motors and generators, for exam- 
ple, have forar motion with respect to the earth if these ma- 
chines are rigidly mounted. 
A rigid body rotating about a fixed axis has only one degree of 
freedom (1.2.7). Its position in space can be completely defined 
by specifying the angle of rotation (position angle) ọ from some 
definite Caina position. 
1.5.3 The angular velocity serves to characterize the rate and 
direction of rotation of a body about an axis. The angular veloc- 
ity is the vector œ, which is equal in magnitude to the first 
derivative of the angle of rotation ọ with respect to time 1 
and is directed along the fixed axis of rotation in such a way 
that from its head the rotation of the body is counterclockwise 
(Fig. 1.6)*: 

__ dy _ dp. 
omar Ne OS 
* The direction of vector œ can also be determined by the 
right-hand screw rule. It coincides with the direction of transla- 
tional motion of a right-hand screw rotating together with the 
body. Vectors like œ, whose direction is related to the direction 
of rotation and is reversed in transforming from a right-handed 
to a left-handed system of coordinates, are called pseudovectors, 
or axial vectors (in distinction to ordinary, polar vectors, which 
are not reversed in the above-mentioned transformation of coor- 
dinates). For example, the vector product of two polar vectors 
is a pseudovector, and that of a pseudovector and a polar vector 
is a polar vector. 
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llere dọ is the vector of an element of fotation of the body dur- 
ing the time dt. This vector is directed along the axis of rota- 
tion, also according to the right-hand screw rule (Fig. 1.6). 
Axial vectors dp and œ have no definite points of application: 
they can be laid off from any point on the axis of rotation. 
In Fig. 4.6 they are laid off from a certain point O of the fixed 


Fig. 1.6 Fig. 4.7 


axis of rotation. This point is also taken as the origin of the 
reference frame. 

Rotation of a body is said to be uniform if the magnitude of its 
angular velocity remains constant with time w= const. In 
this case the angle of rotation of the body is linearly dependent 
on time: pọ = ot. 

1.5.4 An arbitrary point M of a rigid body rotating about fixed 
axis OZ with the angular velocity @ describes a circle of radius 
0 with its centre at point O’ (Fig. 1.7). As distinct from the 
angular velocity of the body, the velocity v of point M is often 
called the linear velocity. It is perpendicular both to the axis 
of rotation (i.e. to vector œ) and to radius vector p, from the 


39 E | KINEMATICS [1 


_centre O’ of the circle to point M, and is equal to their vector 
product: 


= [op] = [or) and v = op. 


Here r = OO’ + pọ and is the radius vector of point M, from 
point O of the axis of rotation. Point O is taken as the origin of 
coordinates. l 
1.5.5 The period of revolution is the time interval T required for 
a body, rotating uniformly at the angular velocity œ, to com- 
plete one revolution about its axis of rotation (i.e. to rotate 
through the angle ọ = 27): T = 27/0. - 
The rotational speed n = 1/T = w/2n is the number of revolu- 
tions made by a body in unit time in uniform rotation at the 
angular velocity œ. 

1.5.6 The rotation of a rigid body in which one of its points 
remains fixed is said to be rotation about a fixed point. This 
point is usually taken as the origin of the fixed reference frame. 
In such rotation, all the points of the body travel along the 
surfaces of concentric spheres whose centre is at the fixed point. 
At each instant of time, this motion of the rigid body can be 
regarded as rotation about a certain axis that passes through 
the fixed point and is called the instantaneous axis of rotation. 
In the general case, the position of the instantaneous axis of 
rotation varies with respect to both a fixed reference frame and 
one rigidly fixed in the rotating body. 

The velocity v of arbitrary point M of such a body is 


v = [or] and v= wp. 


Here œ = d@q/dt is the angular velocity of the body and is di- 
rected along the instantaneous axis of rotation as is the vector 
dọ of an element of rotation of the body during the infinitely 
short time dt, r is the radius vector from fixed point O, about 
which the body rotates, to point M, and p is the distance from 
point M to the instantaneous axis of rotation. Such a body can 
have three independent motions: it can rotate about each of 
three mutually perpendicular axes passing through fixed point 
O. It consequently has three degrees of freedom (1.2.7). 

1.5.7 The angular acceleration vector e is introduced to indicate 
the rate of change of the angular velocity vector with time in 
nonuniform rotation of a body about a fixed axis or its rotation 
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about a fixed point. Vector e is sue: the first derivative of 
the angular velocity œ of the body with respect to the time t: 


dw 


lf the body rotates about a fixed axis, vector e is directed along 
this axis. It coincides in direction with @ in accelerated rota- 
Lion (dw/dt > 0) and is opposite to œ in direction in decelerated 
rotation (dw/dt < 0). The projection of the angular acceleration 
on the fixed axis OZ of rotation is equal to 


where œ, is the projection of vector œ on the same axis. 

1.5.8 As distinct from the angular acceleration, the accelera- 
tion a of arbitrary point M of a rigid body rotating about fixed 
point O, or about a fixed axis passing through point O, is often 
called the linear acceleration. It equals 


d 
eS ae [or] = ayr+aax, 


where a, = [er] is the rotary acceleration of the point, and 
aay = [@ lor]] is its azipetal acceleration, which is directed 
toward the instantaneous axis of rotation. 

lf the body rotates about the fixed axis OZ (Fig. 1.7), the rotary 
acceleration of point M coincides with its tangential accelera- 
tion a; (1.4.5), and its axipetal acceleration coincides with its 
normal acceleration an (1.4.6): 


ar = [er] = [ep] and an = —o?p. 


1.5.9 Any complex motion of a rigid body can be broken down 
into two simple motions: translational motion at the velocity 
va of a certain arbitrarily chosen point A of the body and rota- 
tion about an instantaneous axis passing through this point. The 
angular velocity œ of rotation is independent of the choice of 
point A. The velocity of arbitrary point M of the body is 


v= v4 + [o (r — ry)], 
where r and r, are the radius vectors of points M and A. 
3—0186 
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In the dynamics of a rigid body it usually proves convenient to 
deal with its complex motion as the combination of two simul- 
taneous motions: translational motion at the velocity of the 
body’s centre of mass (2.3.3) and rotation about the centre of 
mass. 

The simplest case of complex motion of a body is plane, or 
plane-parallel, motion, in which all the points of the body trav- 
el in parallel planes. This is the motion, for instance, of a 
homogeneous circular cylinder rolling down an inclined plane. 
In plane motion the direction of the instantaneous axis of ro- 
tation of the body about point A remains constant and vectors 
@ and v4 are perpendicular to each other. 


CHAPTER 2 NEWTON'S LAWS 


2.1 Newton's First Law. Inertial Frames of Reference 


2.1.1 As the first law of dynamics Newton took a law that had 
already been established by Galileo: a particle persistsin its 
state of rest or uniform motion in a straight line until the action 
of other bodies compel it to change that state. 

Newton’s first law indicates that no external action is required 
to maintain a state of rest or of uniform motion in a straight 
line. This manifests a special dynamic property of bodies, 
called their inertia. Newton’s first law is consequently called 
the law of inertia, and the motion of a body not subject to the 
action of other bodies is said to be inertial motion. 

2.1.2 Mechanical motion is relative: its nature may differ for 
different frames of reference (1.2.1) that are moving in relation 
to each other. An astronaut, jor instance, on board a man-made 
satellite orbiting the earth, may be at rest in a reference frame 
fixed in the satellite. At the same time, he is travelling together 
with the satellite with respect to the earth along an elliptic or- 
bit, i.e. not with uniform motion and not in a straight line. It is- 
thus obvious ‘that Newton’s first law does not hold for.any frame 
of reference. A ball, for example, lying on the smooth floor 
of a stateroom in a ship travelling with uniform motion at sea 
in a straight line, may suddenly start moving along the floor 
without being subject to the action of any other bodies. This 
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will occur when the velocity of the ship, i.e. its speed and/or 
direction, begins to change. 

A frame of reference with respect to which a particle, not sub- 
ject to external action, remains at rest or moves uniformly in a 
straight line is called an inertial frame of reference. The content 
of Newton’s first law reduces, in essence, to two statements: 
in the first place, that all material bodies possess the property 
of inertia, and secondly, that inertial reference frames exist. 
2.1.3 Any two inertial reference frames can move only wilh 
translational motion with respect to each other, and only uni- 
lormly in a straight line. It has been experimentally established 
that the heliocentric frame of reference is practically inertial. The 
origin of its coordinate system is located at the centre of mass 
(2.3.3) of the solar system (approximately at the centre of the 
sun) and the three axes extend in the directions of three dis- 
lant stars selected, for example, so that these coordinate axes 
are mutually perpendicular. 

A laboratory frame of reference, whose coordinate axes are rigidly 
lixed in the earth, is noninertial chiefly owing to the earth's 
diurnal (daily) rotation. But the earth rotates so slowly that 
the maximum normal acceleration (1.4.6) of points on its sur- 
face due to diurnal rotation does not exceed 0.034 m/s*. There- 
fore, in the great majority of practical problems, the laboratory 
ae frame can be regarded as being approximately iner- 
lial. 

2.1.4 Inertial reference frames are of prime importance in all 
branches of physics, not only mechanics. This is due to the 
fact that according to Einstein’s principle of relativity (5.1.2) 
the mathematical expression of any physical law should have 
the same form in ali inertial reference frames. Hence, in the 
following we shall make use of only inertial frames of reference, 
without stipulation each time. The laws of motion of a particle 
with respect to a noninertial reference frame is dealt within 
Chapter 7. 


2.2 Force 


2.2.1 Force is a vector quantity which is a measure of the mechan- 
ical action exerted on the body being considered by other 
bodies. Mechanical interaction can be accomplished either be- 
tween bodies in direct contact (for instance, in friction, and in 
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the pressure of bodies against one another), or between remote 
bodies. A special form of matter, connecting particles of matter 
into unified systems and transmitting the action of certain par- 
ticles on others at finite velocities, is called a physical field, or 
simply field. Interaction between remote bodies is accomplished 
by the gravitational or electromagnetic fields they set up (for 
example, the attraction of the planets to the sun, and the inter- 
action of charged bodies or current-carrying conductors). Me- 
chanical action exerted on a given body by other bodies can be 
manifested in two ways. In the first place, it can lead to a change 
in the state of mechanical motion of the body being consid- 
ered, and secondly, to its deformation. Both of these effects can 
serve as the basis for measuring forces. For example, the mea- 
surement of forces by means of a spring dynamometer is based 
on Hooke’s law (40.3.4) for axial tension. 

Using the concept of force, we usually speak in mechanics of 
the motion and deformation of a body by the action of the forces 
applied to it. Here, of course, each force always corresponds 
to i certain body exerting this force on the body being consid- 
ered. 

Force F is completely specified by giving its magnitude, direc- 
tion in space and point of application. The straight line along 
which a force is exerted is called its line of action. 

A field acting on a particle with the force F is said to be steady, 
or stationary, if it does not change in the course of time ż, i.e. 
if force F is explicitly independent of time at any point in the 
field: oF/ot = VU. For a field to be steady itis necessary that 
the bodies which set it up are at rest with respect to the ref- 
erence frame used in considering the field. 

2.2.2 The simultaneous action of several forces, F4, Fə, ..., Fn 
(Fig. 2.4a) on a particle M is equivalent to the action of a single 
torce, called the resultant force and equal to the vector sum of 
the component forces. Thus 


a 
F= )) Fi. 

i=1 | 
It is the closing line of the polygon of forces F,, Fa, ..., Fn 


(Fig. 2.10). 
If a body is perfectly rigid, the action of a force on it doesnot 
change when the point of application is displaced along the 
line of action, but within the limits of the body. In other words, 
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lorces applied to perfectly rigid bodies can be regarded as non- 
localized vectors. as i 4 

2.2.3 A body is said to be free if no limitations are imposed on 
its position and motion in space. An airplane, for example, 
(lying in the air is a free body, as is a submarine travelling in 
the depths of the sea. Most bodies that we deal with are not 


Fig. 2.4 


free; certain limitations are imposed on their possible positions 
and motion. These limitations are called constraints in mechan- 
ics. A ball, for example, hung on a nonextensible thread, can- 
not move away from its point of suspension by more than the 
length of the thread; a streetcar (tram) must travel only along 
its tracks. Constraints are realized duc to the action on the 
body being considered exerted by other bodies attached to or 
contacting this body (for instance, the thread on the ball tied 
to it, the track on the streetcar, etc.). 

In investigating the behaviour of constrained bodies or systems 
of bodies in mechanics, the principle of releasability is applied. 
It states that a restrained body (or system of bodies) can be 
treated as a free body (or system) if the bodies imposing the 
constraints are replaced by the corresponding forces. These 
forces are called reaction forces of the constraints, or constraint 
reactions. All other forces exerted on the body are called active 
forces. Thus, the motion of the ball hung by the thread can be 
treated as the motion of a free ball subject, in addition to all 
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the active forces applied to it (for instance, gravity), to the 
reaction of the thread as well. 

In contrast to the active forces, which should be given in each 
specific problem, the constraint reactions are not known before- 
hand. They are to be determined in solving the problem. Their 
values should be such that under the combined effects of the 
active forces and constraint reactions, the motion of the “re- 
leased” body completely agrees with the limitations imposed by 
the constraints on the constrained body being considered. There. 
as no other distinctions between constraint reactions and active 
orces. 

2.2.4 Bodies not included in the mechanical system being in- 
vestigated are called external bodies. Forces exerted on the 
system by external bodies are called external forces.. Corres- 
pondingly, forces of interaction between the parts or particles 
of the system being considered are called internal forces. 

A mechanical system is said to be closed, or isolated, if it does 
not interact with external bodies. No external forces act on 
any of the bodies of a closed system. 


2.3 Mass. Momentum 


2.3.1 In classical (Newtonian) mechanics, the mass of a particle 
(material point) is a positive scalar quantity which is a measure 
of the inertia of this particle. Under the action of a force the 
velocity of a particle changes gradually, not instantaneously, 
i.e. it acquires an acceleration finite in magnitude and inversely 
proportional to the mass of the particle. To compare the masses, 
m; and ma, of two particles, it is sufficient to measure the 
magnitudes a; and a, of the accelerations acquired by these 
particles when the same force is applied to them: m/m; = 
= a/a. The mass of a body is ordinarily determined by weigh- 
ing by beam scales. 

Classical (Newtonian) mechanics contends that: | 
(a) the mass of a particle is independent of its state of motion, 
being its unalterable characteristic; 

(b) mass is an additive quantity, i.e. the mass of a system (for 
instance, a body) is equal. to the sum of the masses of all the 
particles making up the system; 

(c) the mass of an isolated (closed) system (2.2.4) dees not 
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change with any processes occurring within the system (law of 
conservation of mass). 

These propositions of Newtonian mechanics were reconsidered 
nnd revised in relativistic mechanics (5.6.1, 5.6.2, 5.7.3 and 
5.7.6). 

2.3.2 The density ọ of a body at a given point M is the ratio 
of the mass dm of an infinitely small element of the body, which 
includes point M, to the quantity dV, which is the volume of 
this element. Thus E É 


P=: 


The size of the element being considered should be so small 
that any variation of density within its limits can be neglected. 
On the other hand, this size should be a great many times larger 
than intermolecular distances. 7 
A body is said to be homogeneous if the density is constant at 
all of its points. The mass of a homogeneous body is the product 
of its density by its volume: m = pV. 

The mass of a nonhomogeneous body is 


m= ( 0 aV, 

(V) 
where p is a function of the coordinates, and ples Sie is 
carried out over the whole volume of the body. The average den- 
sity (0) of a nonhomogeneous body is the ratio of its. mass to 
its volume: (9) = m/V. 


2.3.3 The centre of inertia, or centre of mass, of a system of par- 
ticles is point C, whose radius vector ro equals 


n 

1 

To = a > Miri, 
i=1 


where m; and r; are the mass and radius vector of the ith par- 
ticle, n is the total number of particles in the system, and m = 


n 
= 5’ m; is the mass of the whole system, 
i=! | 
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The velocity of the centre of mass is 


2.3.4 The vector quantity p;, equal to the product of the mass 
m; of a particle by its velocity v;, is called the momentum of 
this particle. The momentum of a system of particles is the vector 
p, which is the geometric sum of the momenta of all the par- 
ticles making up the system: 


n 
p= >) Pi. 


i=1 


The momentum of a system is equal to the product of the mass 
of the whole system by the velocity of its centre of mass: p =. 
= MVC. 


2.4 Newton's Second Law 


2.4.1 The basic law of the dynamics of a particle is Newton’s 
second law, which discusses how the mechanical motion of a 
particle is changed by the effect of forces applied to it. New- 
ton’s second law states: the rate of change in the momentum p 
of a particle is equal to the force F acting’on it. Thus 


dp d = 
af or at (mv) =F, 


where m and v are the mass and velocity of the particle. 

If several forces act simultaneously on a particle, the force F 
in Newton's second law is understood to be the geometric sum 
of all acting forces, both active forces and constraint reactions 
(2.2.3), i.e. the resultant force (2.2.2). 

2.4.2 The vector quantity F dt is called an element of impulse 
of force F during the infinitely short time dt of its action. The 
impulse of force ¥ acting in the finite time interval from ¢ = tg 


4] NEWTON’S SECOND LAW 44 


to 
to t= t, is equal to the definite integral | F dt, where F, in 


ty 
the general case, depends upon the time ¢. 
According to Newton’s second law, the change in the momentum 
of a particle is equal to the impulse of the force acting on it: 


te 
dp-=Fdt and Ap=p.—pi= È Fat, 
ti 


where pa = p (t2) and pı = p (#,;) are the values of the momen- 
tum of a particle at the end (t = t.) and beginning (t = t) of 
lhe time interval being considered. 

2.4.3 Since in Newtonian mechanics the mass m of a particle is 
independent of its state of motion, dm/dt = 0. Hence, the 
mathematical expression of Newton’s second law can also be 
presented in the form 


a = m j 

where a = dv/dt = d?r/dié? is the acceleration of the particle 
and r is its radius vector. The corresponding statement of 
Newton’s second law reads: the acceleration of a particle coin- 
cides in direction with the force acting on it and is equal to the 
ratio of this force to the mass of the particle. 

The tangential and normal accelerations of a particle (1.4.4. 
1.4.5 and 1.4.6) are determined by the corresponding compo- 
nents of force ‘F: 


n F a dv Fr 
T m? T dtm? 

and 

a — En ap = LO 
n moto R m’? 


where v is the magnitude of the particle’s velocity vector, and 
R is the radius of curvature of its path. The force Fn, imparting 
normal acceleration to a particle, is directed toward the centre 
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of curvature (1.2.4) of the particle’s path and is therefore called 
the centripetal force. 

2.4.4 If a particle is subject to the simultaneous action of sev- 
eral forces, F,, F,, ..., Fn, its acceleration is 


n 


n 
1 
a=— >} F= als 
l=1 l=1 


where a; = F,/m. Consequently, each of the forces acting si- 
multaneously on a particle imparts to it the acceleration it 
would have if there were no other forces (principle of superposi- 
tion, or independence, of forces). 

The equation 


n 
dèr 
m -ga 7 F = > F; 
l=1 


is called the differential equation of motion of a particle. 
In projections on the axes of a rectangular Cartesian coordinate 
system, this equation is of the form 


d?r d?y d?z 
m-a =F x, m -ya = Fy and m qe 


= F,, 


where z, y and z are the coordinates of the moving particle. 


2.5 Newton's Third Law. Motion of the Centre of 
Mass 


2.5.1 The mechanical action exerted by bodies on one another 
is of the nature of their interaction. This is what Newton's third 
law deals with: two particles exert forces on each other which 
are equal in magnitude and opposite in direction along a line 
connecting these particles. 

If F;, is a force exerted on the ith particle by the kth one, and 
F,; is the force exerted on the kth particle by the ith one, then, 
according to Newton’s third law, | 


Fy; = —Fip- 
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Forces F;, and F,; are applied to different particles and can be 
mutually counterbalanced only when particles i and k belong 
to the same perfectly rigid body. 

2.5.2 Newton’s third law is an essential supplement to the 
first and second laws. It permits us to go over from the dynam- 
ics of a separate particle to that of an arbitrary mechanical 
system (system of particles). It follows from Newton’s third 
law that in any mechanical system the geometric sum of all 
the internal forces (2.2.4) equals zero: 


n n 
S S Fa=0 
i=1 k=1 


where n is the number of particles in the system, and F;; = 0. 
The vector Fext, equal to the vector sum of all the external 
forces (2.2.4) acting on the system, is called the principal vector 
of external forces: 


n 
— NVI pet 
Fext — 2 Fé 
= 


where pe is the resultant external force applied to the ith 


particle. 

2.5.3 It follows from Newton’s first and second laws that the 
first derivative with respect to the time ¢ of the momentum p 
of a mechanical system (2.3.4) is equal to the principal vector 
of all external forces applied to the system. Thus 


p 
——. — Fext 
it — F ° 


This equation expresses the law of the change in the momentum 
of a system. 

Since p = mvc, where m is the mass of a system and vọ is 
the velocity of its centre of inertia, or mass, the law of motion 
of the centre of mass of a mechanical system is of the form 


d 
-gr (Vc) = Fext or mag=Fext, 


where aç = dV,/di is the acceleration of the centre of mass. 
Thus, the centre of mass (centre of inertia) of a mechanical 
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system travels like a particle whose mass is equal to the mass 
of the whole system and which is subject to a force equal to 
the principal vector of the external forces applied to the system. 
If the system being considered is a rigid body that travels with 
translational motion (4.5.1), the velocities v; of all the points 
of the body and vç of its centre of mass are the same and are 
equal to velocity v of the body. Correspondingly, the accelera- 
tion of the body a = aç and the basic equation in the dynamics 
of translational motion of a rigid body is of the form 


ma = Fext, 


2.6 Motion of a Body of Variable Mass 


2.6.1 In Newtonian mechanics the mass of a body can be 
changed only if particles of matter are detached from or added 
to it. An example of such a body is a rocket. In its flight the mass 
of the rocket is gradually reduced because the gaseous products of 
combustion in the rocket engine are ejected through the noz- 
zles. 

The equation of translational motion of a body of variable mass 
(Meshchersky equation) is 

dv dm 


es — ext = aes 
Sage eh es 


where m and v are the mass and velocity of the body at the in- 
stant of time being considered, Fext is the principal vector of 
the external forces (2.5.2) acting on the body, and v, is the 
velocity of the detracted particles after separation from the 
rocket (if dm/dt < 0) or the added particles before being joined 
(if dm/dt > 0). 

2.6.2 The second member on the right-hand side of the Me- 
shchersky equation represents the additional force acting on a 
body of variable mass. It is called the reactive force: 


dm dm 
Be) a Us 
where u = v; —- v is the relative velocity of the particles being 
separated from or joined to the body, i.e. their velocity with 


respect to a reference frame in translational motion together 
with the bodv 
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The reactive force characterizes the mechanical action exerted 
on the body by the particles separated from or joined to it (for 
instance, the effect of the ejected exhaust gases on the rocket). 
2.6.3 The equation of motion of a rocket not subject to the 
aclion of external forces is 


av . dm 

dt dt ` 
If the initial velocity of the rocket is zero the rocket flies in a 
straight line in the direction opposite to that of the relative 
velocity u of the exhaust gases ejected from the engine nozzle. 
Iere 


and at u = const the relation between the velocity of the rocket 
and its mass is expressed by Tsiolkovsky’s formula 


m 
v=u In —, 
m 


where mọ is the initial, or launching, mass of the rocket. 

2.6.4 The maximum velocity that a rocket can reach in the 
absence of external forces is called its characteristic velocity. 
This velocity is acquired at the moment the rocket engine 
stops because it has used up all the fuel and oxidizer that were 


on board, 
Mo 


Pmax =u In Pa my 


where my is the initial mass of the fuel and oxidizer. 

The effects of the earth’s gravity and air resistance lead to an 
appreciable reduction in the maximum velocity actually ac- 
quired by the rocket, during the operation of its engine, as com- 
pared to the characteristic velocity. 

2.6.5 The characteristic velocity of a multistage (combination) 
rocket is 


n 
` Moi 
pmax >, u; In i 2 , 
Moi — Mfi 


i= 
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where n is the total number of stages of the rocket, mpz is the 
mass of the fuel and oxidizer intended for the-operation of the 
ith stage engine, u; is the exhaust gas velocity (nozzle velocity) 
of the ith stage engine, and mọ; is the launching mass of the 
multistage rocket, including all the stages from the ith to the 
nth. The velocity increase of a multistage rocket, compared to 
that of a single-stage one having the same launching mass and 
the same fuel and oxidizer capacity, is due to the reduction in 
the mass of the rocket as the first, second and subsequent stages 
are consecutively jettisoned when their fuel has been ex- 
hausted. 


2.7 Law of Conservation of Momentum 


2.7.1 The law of conservation of momentum is: the momentum p 
of a closed, or isolated, system does not change in the course 
of time, i.e. 


dp 
7 = 0 and p= const. 

In contrast to Newton’s laws, the law, or principle, of conser- 
vation of momentum is valid not only within the framework of 
classical mechanics. It belongs to the most fundamental of 
physical laws because it is concerned with a definite property 
of the symmetry of space: its homogeneity. The homogeneity 
of space is manifested in the fact that the physical properties 
of a closed system and the laws of its motion are independent 
of the choice of position of the origin of the inertial reference 
frame, i.e. these properties and laws are unchanged by a parallel 
translation of the closed system. as a whole in space. According 
to modern concepts, fields can also have momentum, not only 
particles and bodies. Light, for example, exerts a pressure on 
the surface of a body that reflects or absorbs it because the ele- 
ctromagnetic field of the light wave has momentum. 

2.7.2 In application to systems described by classical (Newto- 
nian) mechanics, the principle of conservation of momentum 
can be regarded as a consequence of Newton’s laws. For a closed 
mechanical system the principal vector of external forces 
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Fext = 0 and the principle of conservation of momentum follows 
from (2.5.3): 


n 
p= > m;V;=const, 

ix=1 
where m; and v; are the mass and velocity of the ith particle of 
the system, which consists of n particles. 
Consequently, there is also no change in the projections of the 
momentum of a closed system onto the axes of Cartesian coordi- 
nates of an inertial reference frame: | 


n 

Px = > MiYVix = Const, 
i=l 
n 

Py= >) mwiy = const, 


i=1 


nr 

Pz = >; MiViz = const. 

i=1 
The momentum of the system is p= mvc, where m is the mass 
of the system and vç is the velocity of its centre of mass (2.3.4). 
It follows, therefore, from the principle of conservation of mo- 
mentum that in any processes occurring within a closed system 
the velocity of its centre of mass does not change: vg = const. 
2.7.3 If a system is not closed, but the external forces acting 
on it are such that their principal vector is identically zero 
(Fext = 0), then, according to Newton’s laws (2.5.3), the mo- 
mentum of the system does not change with time: p = 
= const. 
As arule, Fext = 0 and p const. But if the projection of the 
principal vector of external forces onto any fixed axis is iden- 
tically zero, the projection onto the same axis of the momentum 
vector of the system does not change with time. Thus, p, = const 
under the condition that pert = 0. If, for instance, no other 
external forces, except gravity, are acting on a system, the 


horizontal component of the momentum of the system, perpen- 
dicular to the direction of the gravitational force, remains con- 


stant. 
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2.7.4 In some processes (e.g. collisions or in firing shells, etc.), 
the momenta of the parts of the system undergo great changes in 
relatively short time intervals. This is due to internal forces of 
interaction, of short duration, but of quite considerable magni- 
tude, developed within the system. In comparison with these 
forces, all external forces acting continuously on the system 
(for instance, gravity) are small. In such processes, the action 
of the system of external forces can usually be ignored, i.e. it 
can be assumed that the momentum of the system as a whole is 
not changed in the process being considered. 


2.8 Galilean Transformations. Mechanical Principle 
of Relativity 


2.8.1 A Galilean transformation is a transformation of coordi- 
nates and time that is applied in Newtonian mechanics in 
going over from one inertial frame of reference K (z, y, z, t) 
.to another K’ (z’, y’, 2’, t’) which travels with translational 
motion with respect to K at the constant velocity V. The Gali- 
lean transformation is based on axioms concerning the absolute 
nature of time intervals and of lengths. The first axiom con- 
tends that the passage of time (time interval between any two 
events) is the same in all inertial frames of reference. According 
to the second axiom, the dimensions of a body are independent 
of its velocity of motion with respect to the frame of ref- 
erence. 

If the like axes of Cartesian coordinates of inertial reference 
frames K and K’ are drawn parallel to one another pairwise 
and if at the initial instant of time (t = t’ = 0) the origins O 
and O’ of coordinates coincide with each other (Fig. 2.2), the 
Galilean transformations are of the form 


z = z — Vyt, y =y—V,t, 2 =z—V,t and =t 


r’ =r — Vt and t = 1, 


where z, y, z and zx’, y’, 2’ are the coordinates of point M in the 
frames of reference K (at the instant of time t) and K’ (at the 
instant of time ¢’ = t), r and r’ are the radius vectors of point 
M in the same frames of reference, and Vy, V, and V, are the 
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projections of the velocity V of reference frame K’ onto the 
coordinate axes of reference frame K. 

The coordinate axes are usually arranged so that frame of ref- 
erence K’ travels along the positive direction of axis OX 
(lig. 2.3). Under this condition the Galilean transformations 
are of their simplest form: 


t =V y’ =y, 2) =z and t’ =i. 


2.8.2 A consequence of the Galilean transformations is the 
following law for transforming the velocity of arbitrary point 


VK? 


Fig. 2.2 - Fig. 2.3 


M (Fig. 2.2) in going over from one inertial frame of reference 
K (the velocity of the point is v = dr/di) to another K’ (in 
which the velocity of the same point is v’ = dr'/dt): 


v =y yV, 


The projections of the velocity on like coordinate axes are 
transformed accordingly: 


Ver = Vy — Vy, v= vy— Vy and vz, = v, — V.. 


In particular, when reference frame K’ travels along the posi- 
tive direction of axis OX (Fig. 2.3) ` 


Ver = Vy — V, vy = Vy and v, = v,. 


4— 0186 
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The accelerations of point M in frames of reference K (a = 
= dv/dt) and K’ (a’ = dv‘/di) are the same: a’ = a. 

Hence, the acceleration of a particle is independent of the choice 
of the inertial frame of reference or, in other words, it is inva- 
riant with respect to Galilean transformations. 

2.8.3 The forces of interaction between particles depend only 
upon their relative position and their velocity with respect to 
one another. The relative position of any two particles, 2 and 
1, is represented by a vector equal to the difference between the 
radius vectors of the two particles, i.e. to the vector rə, = rg — 
—r, in reference frame K and to the vector rj, = r — r; in 
reference frame K’. It follows from the Galilean transformations 
that rj, = ra. Therefore, the distance between particles 1 and 2 
is the same in reference frames K and K’: 


T31 = ra Or (z3 — zi)? + (Y2 — yi)? + (z3 — z)? 
= (z3-— z1) + (ye — yi)? + (z2 — 21). 


The velocity of particle 2 with respect to particle 1 is equal to 
the difference of the velocities of these two particles: v — v, (in 
reference frame K) and v,—v; (in reference frame K’). It follows 
from the Galilean transformations that v, — vi = Ve — Vj. 
Thus, the relative position and the velocity of relative motion 
of any two particles are independent of the choice of inertial 
reference frame; they are invariant under a Galilean transfor- 
mation. Accordingly, the forces acting on a particle are also 
invariant under a Galilean transformation: F’ = F. 

2.8.4 Equations expressing Newton’s laws (2.4.3 and 2.5.1) are 
invariant with respect to Galilean transformations, i.e. their 
form is unchanged in transforming the coordinates and time 
from one inertial frame of reference (K) to another (K’): 


ma = F and F;; = —F;, (in reference frame K), 
m’a’ = F’ and Fh; = —Fin (in reference frame K’), 


where m’ = m is the mass of the particle being considered and is 
the same in all frames of reference. 

Thus, the mechanical principle of relativity (Galileo’s principle 
of relativity) is valid in classical mechanics. It contends that 
the laws of mechanics are the same in all inertial frames of ref- 
erence. This implies that under the same conditions all mechan- 
ical processes proceed in the same way in different frames of 
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reference. Consequently, it is impossible to establish, by means 
of any mechanical experiments conducted within a closed sys- 
lem of bodies, whether the system is at rest or is travelling at 
uniform velocity in a straight line (with respect to any inertial 
frame of reference whatsoever). 

The mechanical principle of relativity points to the fact that. 
all inertial frames of reference are equally valid in mechanics. 
It is impossible to single out any special, or “main”, inertial 
frame of reference with respect to which the motion of bodies 
could be regarded as “absolute motion”. 

2.8.5 The principle of relativity was generalized to includė all 
physical phenomena by Albert Einstein in his special theory of 
relativity (5.1.2). It was found that coordinates and time in 
various inertial frames of reference are related by the Lorentz 
transformations (5.3.2) rather than the Galilean transforma- 
tions. But at low velocities of relative motion of the frames of 
reference (in comparison to the velocity of light in free space), 
the Lorentz transformations become the Galilean transforma- 
tions. 


CHAPTER 3 WORK AND MECHANICAL ENERGY 


3.1 Energy, Work and Power 


3.1.1 Energy is a scalar physical quantity, which is a common 
measure of the various forms of motion of matter dealt with in 
physics. The energy of a system quantitatively characterizes 
this system with respect to the conversions of motion that are 
possible within it. These conversions take place due to the in- 
teraction between parts of the system, both with one another 
and with external bodies (the environment). The concept of 
various forms of energy has been introduced in physics to ana- 
lyze the qualitatively different forms of motion and the interac- 
tions corresponding to them. These include: mechanical (3.4.4), 
internal (9.1.2), electromagnetic (31.2.4), nuclear (42.2.2) and 
various other forms of energy. 

3.1.2 The mechanical motion of a body is changed by forces 
exerted on the body by other bodies. Used in mechanics for a 


4k 
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quantitative description of such a process of energy exchange 
between interacting bodies is the concept of the work done by 
the forces applied to the body being considered. An element of 
work done by force F in the infinitely small displacement dr is 
the scalar quantity 


SW = F dr = Fv dt, 


where r and v = dr/dt are the radius vector and velocity of the 
point of the body at which the force is applied, and dt is the 
infinitely short time interval during which force F performs the 
work ôW (the meaning of the notation 6W is given in 3.4.8). 
In rectangular Cartesian coordinates 


OW = Fy dx + Fy dy + Fz dz = (Fv + Fyvy + Fzv;) dt, 


where z, y and z are the coordinates of the point of application 
of the force, and Fy, Fy, Fz and vy, vy, vz are the projections of 
vectors F and v on the coordinate axes. 

3.1.3 The expression for an element of work can also be pre- 
sented in the form 


ôW = F ds cos œ = Fr ds, 


where ds = | dr | is an element of the length of the path of the 
point of force application during the infinitely short time in- 
terval dt, œ is the angle between vectors F and dr, and Fr = 
= F cosa is the projection of the force in the direction of the 
displacement dr. A force normal to the path of its point of appli- 
cation performs no work. 

Force F is called the motive force if Fx > 0, so that 5W > 0. 
If Fı < 0 (so that ÔW < 0), force F is called the retarding force 
(force of resistance). 

3.1.4 If a mechanical system is subject to the simultaneous 
action of forces F}, Fa, ..., Fn, the work 6W done by them 
during the infinitely short time dt is equal to the algebraic sum 
of the work done during the same time dt by each of the forces 
separately. Thus 


n n n 
ôW = > ôW; = >, F; dr; = > F;v; dt, 
i=t i=1 i=l 


where r; and v; are the radius vector and velocity of the point 
of application of force F;. 


3.4] ENERGY, WORK AND POWER 53 


For example, for a particle r; = r is the radius vector of the 
particle and v; = v is its velocity. Accordingly, 6W = F dr = 


n 
= Fv dt, where F = > F, is the resultant force (2.2.2). It 
i=1 
follows from Newton's second law (2.4.1) that for a particle 


SW = v ap, 

where p = mv is the momentum of the particle, and m is its 

mass. 

In the translational motion of a perfectly rigid body dr; = 
= dro and v; = Vc, where rc and vç are the radius vector and 
velocity of the centre of mass (2.3.3) of the body. 

The work done by internal forces in any motion of a perfectly 
rigid body equals zero. Hence, in the translational motion of 

such a body ôW = Fext dro = Fextvo dt, where Fext is the 
rincipal vector of external forces (2.5.2). It follows from the 
aw of motion of the centre of mass (2.5.3) that 


SW = Vc ap, 


where p = mvc is the momentum of a rigid body of mass m 
travelling at the velocity v = vg. 

3.1.5 The work W done by force F along a finite portion of the 
path Z of its point of application is equal to the algebraic sum 
of the work done over all the infinitely small portions of this 
path, i.e. it is expressed by the line integral 


. 8 
W = | F ar— | F.ds, 
(L) 0 


where s is the length of arc measured off along the path from 
the beginning of the portion of the path being considered, and 
F+ is the projection of the force onto the direction of displace- 
ment dr of the point of force application. To calculate this 
integral it is necessary to know the dependence of F, ons along 
the given path L. If this dependence is represented graphically 
(Fig. 3.1), the required work W is proportional to the hatched 
area in Fig. 3,1. 

3.1.6 Potential-field (conservative) forces are ones whose work 
depends only on the initial and final positions of their points 
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of application; it depends neither on the shape of the path of 
these points nor on their laws of motion along this path. 

For example, the forces of interaction between the parts (i.e. 
particles) of a system are said to be potential (conservative) 
forces if they depend only upon the configuration of the system, 
i.e. on the relative position of all its particles. The work done 
by these forces in moving the system from one arbitrary position 
to another is independent of the kind of displacement, but is 


Fr ' 7 
yp a 
WA 
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Fig. 3.4 Fig. 3.2 


completely determined by the initial and final configurations 
of the system. Examples are the forces of electrostatic and grav- 
itational interaction. 

A steady (stationary) field (2.2.1) is said to be potential, or con- 
servative, if force F, which it exerts on a particle located in 
the field, is a potential, or conservative force. This implies 
that force F depends only upon the position of the particle in 
the field, and that the amounts of work done by force F in 
displacing the particle from one arbitrary position (1) to ano- 
ther (2) (Fig. 3.2) along any two paths, for instance, fa2 (work 
Wig2z) and 1b2 (work Wipz), are the same: 


2. 
W ia =W m= \ F dr. 
1 


Accordingly, the work done by a conservative force in displac- 
ing its point of application along any closed path L (for exam- 
ple, Ja2b1) is equal to zero: 

Q F dr = 0. 

(L) 
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In the general case, the external bodies that set up the field 
being considered can be in motion with respect to an inertial 
frame of reference, so that their field is not stationary, i.e. force 


F depends explicitly on time: 6F/dt == 0. A nonstationary field 
is a conservative one if the work done by force F upon instanta- 
neous displacement of its point of application along any closed 
path L equals zero: 


F dr =0. 
) 


llere F depends, not only on the coordinates of the point of appli- 
cation, but on the time as well. To calculate this integral, how: 
ever, time is assumed to be a fixed parameter. 

3.1.7 Among the nonconservative forces are dissipative and 
gyroscopic forces. Dissipative forces are ones whose total work 
done in displacing a closed system in any way is always nega- 
tive. Such, for instance, are the forces of sliding friction and 
the forces retarding the motion of bodies in liquids and gases. 
In contrast to conservative forces, dissipative forces depend, 
not only on the relative positions of the interacting bodies, but 
on their relative velocities as well. 

(ryroscopic forces are ones that depend upon the velocity of the 
particle on which they act. They are directed perpendicular to 
this velocity. An example is the Lorentz force (24.1.1) exerted 
by a magnetic field on a charged particle travelling in this 
field. The work done by gyroscopic forces is always equal to 
zero, regardless of how the particle is displaced. 

A mechanical system (system of particles) is said to be conser- 
vative if all the internal forces in it are conservative ones and 
all the external forces are both conservative and stationary. 
Systems that do not satisfy these conditions are said to be non- 
conservative. 

3.1.8 An element of work done by force F, exerted on a particle 
by a stationary conservative field, can be represented in the 
form of an exact differential of a scalar function ® (x, y, z) of 
the coordinates. This is called the force function of the field: 


Fdr=d® or Fy dz-+Fydy+F, dz 
ao 


oD aD 
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ðD ôD oD 
F =m Lim Bane and F = grad Q. 
These last relations are also valid for a nonstationary conser- 
vative field, whose force function depends, not only on the 
coordinates, but on time as well: @ = @® (a, y, z, t). But here 


ôD 


An element of work done by a nonconservative force cannot be 
represented in the form of an exact differential of any function 
whatsoever of the coordinates. This is precisely why an element 
of work done by an arbitrary force is denoted by ôW. 
3.1.9 The concept of power is used in mechanics to characterize 
the amount of work done in unit time. Power (instantaneous pow- 
er) is the scalar physical quantity P. It is equal to the ratio 
of the element of work 6W to the infinitely short time interval 
dt during which it is performed: | 

ôW 
da dt ° 
If F is the force doing the work 6W, the power is the scalar pro- 
duct of force F by the velocity v of its point of application: 


P= Fv = Fyv. 


In the general case, power may vary with time. 

The average power in the time interval from ¢ to t + At is the 
physical quantity (P), which is equal to the ratio of the work 
A during this time interval to the length of this inter- 
val At: 


P=. 


3.2 Kinetic Energy 


3.2.1 The kinetic energy of a body is the energy of its mechanical 
motion. A change in the kinetic energy Ex of a particle due to 
the action of force F is equal to the work done by this force: 


dE, = ÔW = v dp, 
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where p = mv is the momentum of the particle, and m and v 
are its mass and velocity. In Newtonian mechanics m = const 
and the expression for the kinetic energy of a particle is of the 
form 
mv? mv? 
Eg= = 
42 2 


Kinetic energy in relativistic mechanics is discussed in Sect. 
Delle 

3.2.2 The kinetic energy of a mechanical system is equal to the 
sum of the kinetic energies of all the parts of the system. For a 


system consisting of n particles, for instance, 


n n 

1 miyi y miv; 
E TEN m 
Sle 
i=1 


i=1 


where m; and v, are the mass and velocity of the ith particle in 
the system. 
The kinetic energy of a body is 


Ex= \ ov? av = | ov? dV, 
(V) (V) 
where v is the velocity of points of the infinitely small element 
of volume dV of the body, whose density is ọ and mass is dm = 
= odV. Integrating is carried out throughout the whole volume 
V. of the body. If a perfectly rigid body of mass m travels with 
translational motion at the velocity v, its kinetic energy is 
Ey = mv*/2. The kinetic energy of rotating bodies is discussed 
in Sects. 4.3.3 and 4.3.5. 
3.2.3 The change in the kinetic energy of a mechanical system 
is equal to the algebraic sum of the amounts of work done by all 
ee external and internal forces acting on the system (2.2.4). 
Thus 


dk, = 6Wext + Wint, 
For example, for a system consisting of n particles 


n n n 
dE = Pa pert dr; +- >, > Fik dri, 
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where r; is the radius vector of the ith particle, Fext is the re- 
sultant of the external forces acting on this particle, and F;; = 


If the system is not deformed, then the work done by the inter- 
nal forces 


dwint = 0 and dE, = 6Wwext. 


For example, the change in the kinetic energy of a perfectly 
rigid body in translational motion is 


di, = Fextdr, 


where Fext is the principal vector of external forces (2.5.2), and 
dr is the vector of an element of displacement of the body. 
3.2.4 The kinetic energy of a mechanical system depends upon 
the choice of the frame of reference. If in the inertial reference 
frame K the kinetic energy of the system equals Ex, and in the 
reference frame K’, travelling with translational motion at the 
velocity V with respect to K, it is equal to Ei, then 


, , mv? ; 
Ey= k b> +P V, 


where m is the mass of the system, p’ = mvọ is the momentum 


of the system in its motion with respect to reference frame K’, 
and vç is the velocity of the system’s centre of mass with re- 
spect to K’. This equation is valid both for the case when V = 
= const, i.e. when XK’ is an inertial frame of reference, as well 
as for dV/dt Æ 0. 

In particular, if the frame of reference K’ travels with transla- 
tional motion with respect to reference frame K at the velocity 
Vo of m centre of mass of the system, i.e. V = vç, then vg = 
= 0 an | 


mv? ; 
z £ F Ek e 
This equation is an expression of Koenig’s theorem: the kinetic 
energy of a mechanical system is equal to the sum of the kinetic 
energy that a particle would have if its mass was that of the 
whole system and if it travelled at the velocity of the system’s 
centre of mass, plus the kinetic energy of the same system in its 
motion with respect to a translational reference frame (i.e. one 


E= 
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travelling with translational motion) having its origin at the 
centre of mass. 

It follows from Koenig’s theorem that the kinetic energy of a 
perfectly rigid body is equal to the kinetic energy of translation- 
al motion of this body at the velocity of its centre of mass plus 
the kinetic energy of rotation of the body about its centre of 
mass. 


3.3 Potential Energy 


3.3.1 Potential energy is the part of the energy of a mechanical 
system that depends only on its configuration, i.e. on the rela- 
live arrangement of all the particles of the system and their 
positions in the external potential (conservative) field (3.1.6). 
The decrease in the potential energy when the system is moved 
from arbitrary position 4 to another arbitrary position 2 is 
measured by the work W> done in this displacement by the 
(O TEORYATENE forces (internal and external) acting on the system. 
Thus 


Ep (A) — Ep (2) = Wiz, 


where E,, (1) and Ep (2) are the values of the potential energy of 
the system in its initial and final positions. Accordingly, the 
work done by conservative forces upon an infinitely small 
change in the configuration of the system is ôÔW = —dE). 

Note. It is assumed here that the external conservative forces 
are stationary (steady), i.e. they can change with time only as 
the consequence of a change of the position of the system being 
considered with respect to the frame of reference. Otherwise 


Ey 


aE dt. 


dEy=—8W+ 


In the simplest case, when the system is a particle located in a 
conservative field, the relation between force F, acting on the 
particle, and the potential energy Æp of this particle in the 
field is of the form 


OE p dE» dE p 


—— es 4 =S a T —— 


“o osr? YO y? 7” &z 
and F= —grad Ep. 
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The potential energy Ey of a particle is related to the force 
function (3.1.8) of the corresponding conservative field by the 
equation 


dE, = —d® or Ep (z, y, z, t) = —Ọ (z, y, z, th + C, 


where C is the integration constant. 

3.3.2 The equations in 3.3.4 enable the dependence of the po- 
tential energy of a system on its configuration to be determined 
only to an arbitrary constant term that has no effect on the 
change in energy. To obtain a unique dependence of the potenti- 
al energy of the system on its configuration, a so-called zero 
configuration is chosen, in which the potential energy of the 
system is conditionally assumed to equal zero. Thus, the po- 
tential energy of a system in an arbitrary state is equal to the 
work done by all the conservative forces acting on the system 
in transforming it from the state being considered to the state 
corresponding to the zero configuration. 

3.3.3 Example í. Potential energy of a particle in a uniform 
(homogeneous) force field. Assume that force F, exerted on the 
particle by the field, is directed along axis OZ, i.e. F = Fk, 
where k is the unit vector of axis OZ, and the projection F, 
of force F onto axis OZ is independent of the coordinates of 
the particle. Then 


dE, = —F dr = —F, dz and Ey (2) = —F,z + Ep (0), 


where EF, (0) is the value of the potential energy of the particle 
at the level z = 0 

In particular, the potential energy of a particle of mass m lo- 
cated in a uniform gravity field at the earth’s surface (axis OZ 
is directed vertically upward, F, = —mg, where g is the free 
fall acceleration) is equal to 


Ey (2) = mgz + Ep (0). 


3.3.4 Example 2. Potential energy of a particle in a central force 
field. In a conservative field of central forces the particle is sub- 
ject to the action of forces F which are directed everywhere 
along straight lines passing through a fixed point, called the 
centre of force, and depend only on the distance r to this point: 


r 
P= F;(r)—. 
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liere r is the radius vector from the centre of force to the point 
being considered in the field, and Fry (r) is the projection of force 
l onto the direction of vector r; it depends only on the distance 
r. If the particle is attracted to the centre of force, Fr (r) = 

—|F | < 0, but if it is repelled from the centre of force, 
then Fr (r) = | F | > 0. An element of work done by force F is 


SW = F dr = Fr (r) dr. 


The potential energy of the particle is 


oO 


My (r) = \ F, (r) dr+ Ep (00). 


Usually the energy of a particle, located at an infinite distance 
from the centre of force, is taken as the reference point for po- 
lential energy, i.e. it is as- 

sumed that Æp (œ) = 0. Thus Fi 


oo 


Ey r)= | F, (r) dr. 


r 


Ixamples of a central force field, 
in which the force is inversely 
proportional to the square of 
the distance to the centre of 
iorce (Fr (r)ocr-*), are the grav- 
ftational fields of a particle 
or a homogeneous sphere, the Fis. 3.3 

clectrostatic field of a point oe 

charge, as well as that of a 

hollow sphere and a solid sphere uniformly charged along the 
surface and throughout the volume, respectively. 

3.3.5 Example 3. Potential energy of a system of two particles 
between which central forces act, i.e. forces depending upon the 
distance between the particles and directed along the straight 
line joining the particles. Shown in Fig. 3.3 are mutually repul- 
sive forces F; and Fy, = —Fyio: 


For =F (p) -£ ’ 
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where pọ = r, — r; is the radius vector from particle 1 to par- 
ticle 2, and Fp (p) is the projection of force F,, on the direction 
of vector 9 and depends only on the distance p between the 
particles. An infinitely small change in the 

y potential energy of the system 


dEp = — (Fiz dry + Fo; dro) 
= —F2 dp = —Fp (p) dp. 


If we assume that Ep > 0 as p > oo, then 
Ep (p)= \ F 5 (ge) dp. 
p 


This energy is often called the mutual po- 
tential energy of two particles. 
Fic. 3.4 3.3.6 Example 4. Potential energy of an 
Beas elastic body (for instance, a spring) in azial 
tension or compression. When an elastic 
body is deformed, conservative internal forces (elastic forces) 
are developed in the body that oppose deformation. According 
to Hooke’s law, the force Fe) exerted by the deformed body A 
(Fig. 3.4) on body B, causing the deformation, is proportional 
to the amount of deformation (stretch or compression): 


Fa = —Azi. 


Here xi is the displacement vector of body B that characterizes 
the deformation of body A (in the undeformed state x = 0; in 
compression x > 0 and in tension x < 0), and k > 0 is a factor 
characterizing the elastic properties of body A. 

The potential energy of an elastically deformed body (in the 
absence of deformation, i.e. at z = 0, this energy is assumed 
equal to zero) 


kr? 
Ep = 9 & 
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3.4 Law of Conservation of Mechanical Energy 


3.4.1 The mechanical energy, or total mechanical energy, is the 
energy of mechanical motion and interaction. The mechanical 
energy E of a system of particles is equal to the sum of their 
kinetic energy Ey and the potential energy £, of interaction of 
these particles with one another and with external bodies: 


k = Ex + Ep 


An elementary increment of the mechanical energy of a system 
during the infinitely short time di is equal to 


N Ep 
dl = ôW none + DTI dt, 


where OWnonc is the algebraic sum of the elements of work 
done in the time interval dt by all the internal and external 
nonconservative forces. The term (dE,/dt) dt represents the 
change during the time interval dt in the potential energy of 
the system and, consequently, its total mechanical energy 
due to nonstationary external conservative forces (3.3.1). 
3.4.2 If the system is conservative (3.1.7), 6Wnone = 0 and 
ðE p/ðt = 0, because, in this case, there are no nonconservative 
forces and the external conservative forces are stationary. Con- 
sequently, the mechanical energy of such a system E = const, 
i.e. the following law, called the law of conservation of mechan- 
ical energy, is valid: in the motion of a conservative system, its 
mechanical energy remains unchanged. 

In. particular, this law holds for closed conservative systems: 
the mechanical energy of a closed system does not change with 
time if all the internal forces acting in this system are conser- 
vative ones. 

The law of conservation of mechanical energy is associated with 
the uniformity of time. This property of time is manifested in 
the fact that the laws of motion of a closed system (or a system 
located in a stationary external field) are independent of the 
choice of the zero time reference. For example, in the free fall 
of a body in a stationary conservative gravity field at the earth’s 
surface, the velocity of the body and the length of path travelled 
depend only on the length of time in free fall and initial vel- 
ocity of the body, but not on the specific instant of time the 
body began to fall. 
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3.4.3 The mechanical energy of a closed nonconservative system 
is changed owing to the work done by all the nonconservative 
internal forces: 


dE = 6Wnonc. 


Gyroscopic forces (3.1.7) do no work and make no contribution 
to 5Wnonc, i.e. the existence of such forces in a system docs 
not lead to changes in its mechanical energy. 

The action of dissipative forces (3.1.7), for instance, the force 
of friction, leads to a gradual reduction in the mechanical energy 
of a closed system. This process is called the dissipation of 
energy. Consequently, a system whose mechanical energy contin- 
ually decreases with time is called a dissipative system. In 
the dissipation of energy, the mechanical energy of the system 
is converted into other forms of energy (for example, into the 
energy of chaotic motion of molecules). The conversion of 
mechanical energy is accomplished in full agreement with 
a universal law of nature: the law of conservation of energy 
(9.2.7). | 

According to this law, energy can be converted from one form 
to another and redistributed within the system, but its total 
amount in a closed system must remain constant. It follows 
from the law of conservation and conversion of energy that the 
changes in energy of an open system, occurring in interaction 
of the system with the external medium (external bodies and 
fields), must be numerically equal and opposite in sign to the 
changes in the energy of the external medium. In other words, 
the change in the energy of a system upon its interaction with 
the external medium should be equal to the energy the system 
receives or yields up in the process being considered. 

3.4.4 Forces of resistance and friction act in all real mechanical 
systems. Hence, these systems are nonconservative. In certain 
cases, however, they can be assumed approximately conserva- 
tive, and the law of conservation of mechanical energy can be 
applied to them. Such an approach is feasible if, in the process 
being considered, the work Wnonc done by all the nonconserva- 
tive forces acting on the system is negligibly small in compari- 
son to the mechanical energy E of the system, i.e. | Wnonc/E | < 
<1, so that | AE/E | <1, where AE = Wnonc is the change 
in the mechanical energy of the system. 

3.4.5 The state of mechanical equilibrium of a system is one 
that can be disturbed only as a result of external force action 
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npplied to the system. In this equilibrium state all the particles 
ol the system are at rest so that the kinetic energy of the system 
aquals zero. The state of mechanical equilibrium is said to be 
stable if a small external action applied to the system leads to 
only a small change in its state. In this case, forces are devel- 
oped in the system that tend to restore the equilibrium state. 
A state of mechanical equilibrium is said to be unstable if it 
can be disturbed by even an infinitely small external action 
on the system, which does not return again to the equilibrium 
state. In this case, forces are developed in the system that tend 
to further disturb the equilibrium state. 

‘he law of conservation of mechanical energy enables the condi- 
tions of equilibrium of conservative systems to be formulated: 


Fig. 3.5 


in states of stable equilibrium the potential energy curve of 
(he system has minima, whereas in states of unstable equilib- 
rium, it has maxima. 

3.4.6 On the basis of the law of conservation of mechanical 
energy, the region of possible configurations of a conservative 
system (3.3.1) can be found. The kinetic energy of the system 
lk =œ 0. Hence, at a given value Æ of the mechanical energy 
of the system, the latter can only be in states that conform to 
the condition: Ey < E. Figure 3.5 illustrates the simplest case 
in which a particle has one-dimensional motion along axis OX 
in an external stationary conservative field. The potential 
energy of the particle is a function of only a single coordinate z, 
i.e. Ep = Ep (z). The curve of this relation, shown in Fig. 3.5, 


hb—0186 
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is called a potential energy curve. For a fixed value of the mech- 
anical energy E of the particle, as shown in Fig. 3.5, the particle 
can remain, in its travel, in one of the following three regions: 
x S z, (region I), xy S< z < z; (region III) and z > z, (re- 
gion V). These regions are separated from one another by the 
regions JJ and ZV, which are the so-called potential barriers aeb 
and cgd, within whose limits the particle cannot exist. At the 
. boundaries of the potential barriers (at points a, b, c and d), 
the particle reverses its direction of motion; in region J the 
particle can travel without limit to the left of boundary a of 
the barrier, and in region V, without limit to the right of bound- 
ary d of the barrier. In region III, the particle oscillates be- 
tween points b and c; it is in a so-called potential well bfe. 


3.5 Perfectly Elastic and Inelastic Collisions 


3.5.1 An impact in which a considerable change in the veloc- 
ities of the colliding bodies occurs in an extremely short time 
interval is called a collision. Examples of collisions are a ham-- 
mer striking a billet being forged that lies on the anvil, and 
a hammer striking the head of a nail being driven. 

The line of impact is the common normal to the surfaces of the 
colliding bodies at their point of contact. If the centres of mass 
(2.3.3) of the colliding bodies lie on the line of impact at the ins- 
tant of contact, the collision is said to be central. An example is 
the collision of two billiard balls. The collision, or impact, is 
said to be head-on, or straight-line, if the velocities of the centres 
of mass of the colliding bodies are parallel, before the collision, 
to the line of impact. Otherwise, it is called an oblique impact, 
or glancing collision. 

3.5.2 Ina collision the bodies are deformed and instantaneous, 
but quite substantial, forces are developed. They are called 
impact forces. For a system of colliding bodies these forces are 
internal ones*, i.e. they do not change the total momentum of 
the system. External forces, constantly exerted on the system 


* It is assumed that the colliding bodies are either free (2.2.3) 
or the constraints imposed on them are such that no impact 
constraint reactions are developed. 
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(for instance, the gravity forces of the bodies) are usually very 
small compared to the impact forces. Therefore, even though 
the impulses of impact forces (2.4.2) in the duration t of the 
collision are commensurable with the momenta (2.3.4) of the 
colliding bodies, the resultant impulse of all the constantly 
acting external forces during the same time interval t is small 
compared to the momenta of the bodies. Accordingly, the work 
done on the system by the external forces during the time inter- 
val t is small compared to the mechanical energy of the system. 
ITence, a system of bodies can be assumed, in the collision pro- 
cess, to be an approximately closed system (2.2.4). This means 
that in calculating the results of the collision, use can be made 
of the laws of the conservation of momentum (2.7.1), of angular 
momentum (4.4.1) and of energy (9.2.7). If the bodies deform 
in the collision like perfectly elastic ones, the impact forces 
are conservative and the law of conservation of mechanical 
energy (3.4.2) is valid in the system. 

3.5.3 The collision of two bodies is said to be perfectly inelastic 
if, after impact, they travel together as a single body. Suffi- 
ciently close to a perfectly inelastic collision are, for example, 
the impact of the pile-driving hammer with the end of the pile 
being driven, and the firing of a bullet into a wagon filled with 
sand, when the bullet remains embedded in the sand. Various 
processes occur in the colliding bodies in an inelastic collision 
(plastic deformation of the bodies, friction, etc.). As a result, 
the kinetic energy of the system is partly transformed into its 
internal energy (9.4.2). 

If two bodies of masses m; and m, travelling with translational 
motion at the velocities v, and v,, undergo a perfectly inelastic 
head-on central collision, they continue their translational motion 
together at the veloeity 


— M4Vi t My2 
mitm, ' 


Note. In the case of a perfectly inelastic collision that is neither 
head-on nor central, the preceding equation can be used to find 
the velocity of the centre of the mass of the bodies joined to- 
gether in the collision. As the result of such a collision, how- 
ever, the system may begin to rotate about its centre of mass in 
accordance with the law of conservation of angular momen- 
tum (4.4.1). 


Rk 
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3.5.4 The change in the kinetic energy of a system of two collid- 
ing bodies in a perfectly inelastic head-on central collision is 


ABE AEM p y2_ 5 V3 
Mm (v;— V)? < 0. 


2 (my Fma) 


In particular, if the second, body is at rest before the collision 
(for instance, a pile being driven by a pile driver, or a forging 
lying on the anvil), the relative reduction in the kinetic energy 
of the system in a perfectly inelastic head-on central collision is 


AE, po! Mo 
Ex, mı tm ` 


In engineering, a perfectly inelastic head-on central collision 
(impact) is made use of to change the shape of bodies (smith 
and die forging, rivetting, etc.) or to move a body in a medium 
with high resistance to motion (driving nails, piles, etc.). It 
proves expedient in the first case to have the ratio —AEx/Ex, 
as near as possible to unity, i.e. it is necessary that m `> m, 
(the mass of the forged item and the anvil should be many times 
greater than that of the hammer). In the second case, on the 
contrary, it is necessary that the loss in kinetic energy in the 
collision (impact) be as low as possible, i.e. that m, > m, 
(the mass of the hammer should be many times greater than 
that of the nail being driven). 

3.5.5. A collision is said to be perfectly elastic if it does not lead 
to any change in the mechanical energy of the system, i.e. if 
the bodies are perfectly elastic. 

Example 1. A perfectly elastic head-on central collision of two 
bodies (for instance, balls) of masses m, and m, which travel 
before the collision with translational motion at the velocities 
vı and v, along axis OX, passing through their centres of mass 
(Fig. 3.6a). The velocities of the bodies after the collision, 
u, and u, (Fig. 3.6b), can be found from the laws of conserva- 
tion of momentum and mechanical energy: 


MyUy + MU = MV1 F MeVg, mu + mau = mv? + mv}. 
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Velocities u, and u are directed along axis OX and their 
projections on this axis are 


(my— Mg) Vy x +2mMgqVox 


u == 

1x mı +m 

and diraz 2MV1x + (Ma — M1) Vox 
a | mı + mg ° 


In particular, if the masses of the bodies are the same, the 
bodies exchange velocities in a collision: uzy = ve, and uzy = 


= Viy 
If the mass of the second body is many times greater than that 
of the first body, then uiy X 2v,, — vy, and Ugy Va. 


(2) Before collision (b) After collision 


Fig. 3.6 


3.5.6 Example 2. Perfectly elastic oblique central collision. 
If the bodies are smooth the impulse of friction forces in the 
collision can be neglected. In this case the tangential compo- 
nents of the velocities of the bodies are not changed, i.e. compo- 
nents perpendicular to the line of impact: uit = vyt and ust = 
= ver. The normal components, directed along the line of 
impact, change in the same way as in a head-on collision: 


(Mı — Mg) Vin +2MgVon 


u = 
ai mı + mg 
and __  2MWiın + (M — mM) Von 
an Uon = Pasar eg tampa Tee 
1 


In particular, in the perfectly elastic oblique collision of a 
smooth ball with a fixed flat wall (m, >> m, and ug = vg = 0):. 
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i.e. the ball rebounds from the wall according to the law of mir- 
ror reflection: the angle of reflection is equal to the angle of 
incidence. The magnitude of the velocity is conserved: u; = 1. 
The vector Ap, of the change in momentum of the ball in the 
collision is directed perpendicular to the wall: 


Ap; = m (uy — Vy) = —2m,V,n. 


The impulse of impact force acting on the wall is equal to 2m,v,p. 


CHAPTER 4 DYNAMICS OF ROTARY MOTION 


4.1 Moment of Force and Angular Momentum 


4.1.4. The moment of force, or force moment is a concept 
introduced in mechanics to characterize the external mechanical 
action exerted on a body that changes its rotary motion. Dis- 
tinction is made between the moment of force about a fixed 
point and about a fixed axis. 

The moment of the force F about a fixed point O (the pole) is the 
vector quantity M, equal to the vector product of the radius 
vector r, from point O to point A 
of force application (Fig. 4.1), by 


the force vector F: 
M = [rF]. 


The magnitude of the moment of 
force is M = Fr sin a= FI, 
where œ is the angle between 
Fig. 4.1 vectors r and F, and l =r sina 
is the length of the perpendic- 
ular OB (Fig. 4.14) dropped from 
point O to the line of action of the force. The quantity Z is 
called the arm of the force with respect to point O. If the point 
of application of force F is displaced along its line of action, 
the moment M of this force about the same fixed point O is not 
changed. If the line of action of the force passes through point O, 
the moment of the force about point O equals zero. 
4.1.2 The principal, or resultant, moment of a system of forces 
about fixed point O (pole) 1s the veetor M, equal to the geometric 
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sum of the moments about point O of all n forces of the system: 


M = >) [mF;], 


i=1 


where r; is the radius vector from point O to the point of appli- 

cation of force F;. 

It follows from Newton’s third law (2.5.1) that the moments 

of the internal forces of interaction between the particles of 

n system about pole O are com- l 

pensated for (cancelled out) pair- Z 

wise: [r;F;}] = — [rpFp;]. Con- 

sequently, in calculating the 

resultant moment of forces, on- 

lv the external forces exerted on 

the mechanical system being 

considered need be taken into 0, 

account. 

4.1.3 The moment of force F 

about a fixed axis a is the scalar 

quantity M, equal to the pro- la 

jection on this axis of vector M, f 

which is the moment of forceF Fig. 4.2 

about arbitrary point O of axisa. 

The value of moment M, is independent of the choice of the 

position of point O along axis a. 

Note. Sometimes the moment of a force about fixed axis a 

is understood to mean the vector quantity M, = Maia, where 

i, is the unit vector of axis a. The vector M, is the compo- 

nent of vector M of the moment of force about pole O, this 

component being directed along axis a. 

Tf the line of action of the force intersects or is parallel to axis a, 

the moment of force about this axis is equal to zero. 

Let A be the point of application of force F, and O, be the foot 

of the perpendicular dropped from point A to the axis OZ being 

considered (Fig. 4.2). It proves convenient to resolve force F 

into three mutually perpendicular components: axial compo- 

nent F,, parallel to axis OZ;iradial component Fn, directed along 
2 

vector ọ = O,;A, and tangential component F,, directed per- 

pendicular to the axis and perpendicular to vector p. The mo- 
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ment of force F about axis OZ is 
M, = [pF]; and M, = [oF]. 
Since vectors ọ and F, are perpendicular to each other 


| M,|=|M,|=oe|Frl. 


The principal, or resultant, moment of a system of forces about 
a fixed axis a is equal to the algebraic sum of the moments of all 
the forces of the system about this axis. 

4.1.4 The angular momentum, or moment of momentum, of a 
particle about fixed point O (pole) is the vector L. It is equal 
to the vector product of radius vector r, from point O to the 
location of the particle, by vector p of its momentum: 


L = [rp] = [rmy], 


where m and v are the mass and velocity of the particle. 

The angular momentum of a system about fized point O is denoted 
by vector L and equals the vector sum of the angular momenta 
of all the particles of the system about the same point 0: 


n 


n 
L= 3 [r:p:] = > [rimivi), 
i=1 


i= 


where m;, r; and v; are the mass, radius vector and velocity 
of the ith particle, and n is the total number of particles. 

The angular momentum of a system about fixed axis a is the sealar 
quantity La, equal to the projection on axis a of vector L, the 
angular momentum of the system about some point O lying 
on this axis: 


n 


= 
La = >; [rimiVi]a - 
i=={ 


The numerical value Z, of the angular momentum is indepen- 
dent of the choice of the position of point O along axis a. | 
Note. Sometimes the angular momentum of a system about 
a fixed axis a is understood to mean the vector quantity L, =. 
= Laia, where i, is the unit vector of axis a. 
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4.1.5 The angular momentum of a body rotating about a fixed 
point O at the angular velocity is equal to: 


L= [rv] dm = [r [@r]] dm, 
i l 


where r is the radius vector from point O to an infinitely small 
clement of the body of mass dm, and v = [ar] is the velocity 
of the element of the body. Since [r [or]] = r?@ — (or) r, 
vectors L and @ do not coincide in direction in the general 
case; 


L=o \ r? dm — f (or) rdm. 
(m) (m) 


The angular momentum of a body hinged at point O and its 
angular velocity coincide in direction if the body rotates about 
one of its principal axes of inertia (4.2.4) passing through 
point O i 


L = Io, 


where I is the moment of inertia (4.2.1) of the body about this 
principal axis. 

4.1.6 The resultant moments M and M* of a system of forces 
about two different fixed points O and O* are related by the 
equation 


M = M* + [r*F], 


where r* is the radius vector from origin O to point O*, and F 
is the principal vector of the system of forces being considered: 
If F = 0 the resultant moment of the system of forces is the 
same about any fixed point: M* = M. This, precisely, is the 
property possessed by a force couple, or simply couple, i.e. 
a system of two forces, equal in magnitude and directed along 
two parallel straight lines in opposite directions. The shortest 
(perpendicular) distance d between the lines of action of the 
forces is called the arm of the couple. The moment, or torque, 
of a force couple is directed perpendicular to the plane of the 
forces, and its magnitude is equal to M = Fd, where F is the 
magnitude of each of the forces of the couple. 
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The resultant moment Mç of all the forces acting on a mechan- 
ical system about the centre of mass (2.3.3) of this system is 
related to the resultant moment M of the same system of ferces 
about fixed point O by the equation 


M = Mc A [rcF], 


where rç is a radius vector from origin O to point C, and F is 
the principal vector of the system of forces. 

4.1.7 The value of the angular momentum of a mechanical 
system about its centre of mass C is the same for the absolute 
motion of the particles at the velocities v; (i.e. with respect 
to a fixed inertial frame of reference) and for the relative motion 


at the velocities vi = v; — Vc (i.e. with respect to a frame of 
reference in translational motion with the origin of coordinates 
at point C): 


n n 
q 
> [eymivi] = $) (rimivi]=Lc, 
i=! izi 


where ri = r; — rç is the radius vector of the ith particle in 
a frame of reference travelling together with the centre of 
mass. 

The relation between the values of the angular momentum L 
of a mechanical system about fixed point O and Le about the 
system’s centre of mass is of the form 


L = Lo ar [rcp], 
n 
where p= > m;v; is the momentum of the system in its 


i=1 
absolute motion. 


4.2 Moment of Inertia 


4.2.4 The moment of inertia of a mechanical system about fixed 
axis a is the physical quantity denoted by J, and equal to the 
sum of the products of the masses of all n particles of the system 
by the squares of their normal distances from the axis: 


n 
Iqg= > miPj, 
ii 
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where m; and p; are the mass of the ith particle and its normal 
distance from the axis. 
The moment of inertia of a body about fixed axis a is 


i= 0? dm = \ 02D dV, 
(m) (V) 


where dm = D dV is the mass of an infinitely small element of 
volume dV of the body, D is the density and p is the normal 
distance from the element dV to axis a. 

If the pony is homogeneous, i.e. its density is the same through- 
out, then 


Ia =D f 02 dV. 
(V) 


The moment of inertia J, is a measure of inertness of a body in 
its rotary motion about fixed axis a (4.3.4), in the same way 
that its mass is a measure of inertness of the body in its transla- 
tional motion. 

4.2.2 The moment of inertia of a given body about some axis 
depends, not only on the mass, shape and size of the body, but 
also on the position of the body with respect to this axis. Ac- 
cording to Steiner’s theorem of parallel axes, the moment of 
inertia J of a body about an arbitrary axis is equal to the sum 
of its moment of inertia Iç about a parallel axis through the 
centre of mass (or of inertia) of the body plus the product of 
the mass m of the body by the square of the normal distance d 
between the two axes: 


I = Igo + md. 


e 


4.2.3 The moments of inertia of certain homogeneous regularly 
shaped bodies about certain axes are listed in Table 4.1. 
4.2.4 The products of inertia of a body about the axes ofa rec- 
tangular Cartesian coordinate system are the quantities: 


i= \ ry dm = \ xyD dV, 
(m) (V) 


TABLE 4.1 
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Body 


Hollow thin-walled cir- 
cular cylinder of radius 
R and mass m 


Solid cirenlar cylinder 
(or disk) of radius R and 
mass m 

Solid sphere of radius R 
and mass m 

Hollow thin-walled 
sphere of radius R and 
mass m 


Thin straight rod of 
length Z and mass m 


The same rod 


Position of axis a eRe eee 
a 

Axis of the cylinder m R? 
Axis of the cylinder m R2/2 
Axis through the centre | 2m R?/5 
of the sphere 
Axis through the centre| 2mR?/3 
of the sphere 
Axis perpendicular tothe| mJ?/12 
rod and through its 
centre along its length 
Axis perpendicular to the| ml2/3 


rod and through one end 
of it 


| Cr | az dm = \ xzD dV, 


(m) (V) 


Iy = 


(m) (V) 


= \ yz dm = f yzD dav, 


where z, y and z are the coordinates of an infinitely small 
element of the body of volume dV, density D and mass m. 
Axis OX is called the principal aris of inertia of the body if 


the products of inertia Jy, and [,, 


are simultaneously equal 


to zero. Three principai axes of inertia can be passed through 


each point of t 


e body. The axes are mutually perpendicular. 
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‘I'he moments of inertia of a body about the three principal axes 
of inertia, passed through an arbitrary point O of the body, are 
called the principal moments of inertia of the body. 

The principal axes of inertia, passing through the centre of mass 
of the body, are called the principal central axes of inertia of the 
body, and the moments of inertia about these axes are called 
the principal central moments of inertia of the body. An axis of 
symmetry of a homogeneous body is always one of its principal 
central axes of inertia. Í 


4.3 The Fundamental Law in the Dynamics 
of Rotary Motion 


4.3.1 It follows from Newton’s laws that the first derivative 
with respect to time ¢ of the angular momentum L of a mechan- 
ical system about any fixed point O is equal to the resultant 
moment Mext of all the external forces, applied to the system, 
about the same point O: 


di, <2: ext 
dt ° 


This equation expresses the law of the variation of angular mo- 
mentum of the system. It is valid, in particular, for a rigid body, 
hinged at point O and rotating about it. In this case, the equa- 
lion expresses the fundamental law of dynamics of a rigid body 
rolating about a fixed point. 
ln terms of the projections on the axes of a fixed rectangular 
(artesian coordinate system with its origin at point O, the law 
lor the variation of the angular momentum is written in the 
form 

dL _ ext dLy — ext - dL, _ ext 

q Mx j T =M} and Tt =M,. 


Wert Ly, Ly, D5, mext M&*t and M&** are the angular mo- 
menta of the system and the resultant moments of the external 
lorces about the corresponding coordinate axes. 

4.3.2 Example. The regular precession of a gyroscope due to its 
pravitational force. A gyroscope (symmetrical gyroscope) is a rigid 
hody spinning (rotating rapidly) about its axis of symmetry 
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that can change its position in space. A gyroscope has three 
degrees of freedom (1.5.6) if it is mounted at fixed point O, 
which lies on its axis and is called the centre of suspension of the 
gyroscope. If the centre of suspension coincides with the centre 
of gravity C (7.3.2) of the gyroscope, the gyroscope is said to be 
balanced: the force of gravity does not 
affect its state of rotation. Otherwise, 
it is said to be a heavy gyroscope, or 
top (Fig. 4.3). Owing to the action of 
the moment of gravitational force about 
point O 


Mext = [romg] 


a heavy gyroscope rotates about this 
point so that its axis OZ’ rotates uni- 
formly about vertical axis OZ describ- 
ing a conical surface (shown by dashed 
lines in Fig. 4.3). Such motion of the 
gyroscope is called regular precession. 
If the angular velocity of precession 
Q< œ (where œ is the spin angular velocity or, simply, 
spin velocity of the gyroscope about its axis of symmetry 
OZ’), it can be assumed with some approximation that 
the angular momentum L of the gyroscope about point O is 
directed along axis OZ’ of the gyroscope and equals 


L = Ja, 


where J is the moment of inertia of the gyroscope about axis OZ’. 
Hence 


dL oe = ro J 
-gz ~ [omg] =| To, Lmg |=I91], 


where Q = —mrcg/Iw,, is the angular velocity of precession, 
and w,, = in the case shown in Fig. 4.3. The greater the spin 


angular velocity of the gyroscope the slower its precession. ~ 
4.3.3 The kinetic energy of a rigid body rotating about a fixed 
point at the angular velocity o is 


Iw? 


Pk- , 
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where J is the moment of inertia of the body about the instan- 


Laneous axis of rotation (1.5.6). 
An element of work done in the infinitely short time interval dt 


by force F acting on the body is 
SW = Mo dt = M dg = Madq, 


where M = [rF] is the moment of force F about point O (r is 
the radius vector from point O to the point of application of 
force F), dp = œ dt and dọ = o dt are the angle of rotation 
and the vector of an element of rotation of the body in the time 
interval dt, and Mo is the moment of force F about an instan- 
taneous axis of rotation (Mo is equal to the projection of vector 
M on the direction of vector œ). 

The change in the kinetic energy of a rotating rigid body durin 

the time interval dt is equal to the work done by the externa 


forces: 
dE = M&**do, 


where Met is the resultant moment (4.1.3) of external forces 


about an instantaneous axis of rotation of the body. 
4.3.4 If a rigid body rotates about fixed axis OZ at the angular 
velocity @, its angular momentum about this axis is 


L,=IJ,0, and L,=TI,o. 
Ifere I, is the moment of inertia of the body about axis OZ 


and does not change with time (J, = const), and | œ, | = 
= œ > 0 (mz = w if vector œ and the unit vector of axis OZ 
coincide in direction, otherwise œ, = —@). 


The fundamental law in the dynamics of a rigid body rotating 
about fixed axis OZ is 

dw 1 

~~ — Mert —— Melt 
I; PT =M} or e T, M7 > 
where ¢ = dw/dt is the angular acceleration of the body. 
It is evident from the preceding equation that the moment of 
inertia of a rigid body about some fixed axis is a measure of 
inertness of the body in its rotation about the given axis: the 
greater the moment of inertia of the body, the less the angular 
acceleration it acquires under the action of the same moment of 


external forces. 
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4.3.5 The kinetic energy of a rigid body rotating about fixed 
axis OZ at the angular velocity œ is 


Ey= 1,07. 


An element of work done during the infinitely short time dt 
by force F applied to the lody is 


SW = M,o dt = M, dg, 


where M, is the moment of force F about the axis OZ of rota- 
tion (the unit vector of axis OZ coincides in direction with 
vector @). | 
The change in the kinetic energy of a rigid body during the 
time interval dt is equal to the work done by the external forces 
during this eal 


dE, = Mt do, 


where M°** is the resultant moment of external forces about 
the axis of rotation of the body. 

4.3.6 The motion of a free rigid body satisfies the following two 
differential equations: 


d 
Wt (mvo) = and di 


pert dlc — Mert, 
Here m is the mass of the body, vg is the velocity of its centre 
of mass C, Fext is the principal vector of external forces applied 


to the body (2.5.2), Mé*t is the resultant moment of external 


forces about point C (4.1.6), and Lo is the angular momentum 
about the same point C (4.1.7). 

The first equation describes translational motion of the free 
body at the velocity of its centre of mass (2.5.3). The second 
equation follows from the law of the variation of angular 
momentum (4.3.1) and describes the rotation of a rigid body 
about its centre of mass (4.5.9). 

4.3.7 The kinetic energy of a free rigid body can be deter- 
mined on the basis of Koenig’s theorem (3.2.4): 


move, I co? 


Ek = 2 Ki 2 ? 
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where Iç is the moment of inertia of the body about an instan- 
taneous axis of rotation passing through its centre of mass C, 
and w is the angular velocity of the body. In the general case, 
the instantaneous axis is displaced in the body and the moment 
of inertia Iç varies with time. The value of Iç is constant if 
the body is in plane motion (4.5.9). 

Example. The kinetic energy of a homogeneous circular cylinder 
rolling down an inclined plane without slipping. The cylinder 
has plane motion because all of its points travel in vertical 
planes that are parallel to one another. The cylinder travels 
with translational motion at the velocity vo directed along 
the inclined plane, and rotates about its axis (Zo = mR?/2, 
where m and R are the mass and radius of the cylinder) at 
the angular velocity œ. From the condition that the cylinder 
rolls without slipping it follows that the instantaneous veloci- 
ties of the points of contact between the cylinder and inclined 
plane ayia sere. i.e. œ = vo/R. Hence, the kinetic energy of 
the rolling cylinder is 


mv, Ico? 3 
am. 


ly = 


4.4 Law of Conservation of Angular Momentum 


4.4.1 The law of the conservation of angular momentum can be 
formulated as follows: the angular momentum of a closed system 
(2.2.4) about any fixed point remains constant with time, i.e. 
dL 


ar = 0 and L=const. 


Accordingly, the angular momentum of a closed system about 

its centre of mass (4.1.7) remains constant with time: 

dlc 
dt 


Like the laws of conservation of momentum and energy, the 
law of conservation of angular momentum is of significance far 
beyond the scope of classical mechanics. It belongs to the most 
fundamental of physical laws because it is associated with 
a definite symmetry property of space: its isotropy. The isotropy 


= 0 and Lce=const, 


6—0186 
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of space is manifested in the fact that the physical properties 
and laws of motion of a closed system are independent of the 
direction chosen for the coordinate axes in an inertial frame 
of reference. This means that these properties and laws do not 
change when the closed system is turned as a whole through 
any angle in space. 

According to modern concepts, not only particles and bodies 
can have an angular momentum, but fields as well. Elementary 
particles and systems built from them (for instance, atomic 
nuclei) can have an angular momentum that is not related 
to their motion in space and is called their spin (Table 43.1). 
4.4.2 With respect to systems that can be described by classical 
(Newtonian) mechanics, the law of conservation of angular 
momentum can be regarded as a consequence of Newton’s laws. 
For a closed mechanical system, the resultant moment of exter- 
nal forces about any fixed point (as well as about the centre of 
mass of the system) is identically zero: Mext = 0 (consequently, 


Mext = 0, see 4.1.6, where F = Fext = 0) and the law of 
conservation of angular momentum follows from 4.3.4: 


n 
L= > [r;m;v;]= const, 
i=1 


where m;, r; and v; are the mass, radius vector and velocity of 
the ith particle of the system, which consists of n such particles. 
Accordingly (see 4.4.7 and 2.5.3), 


nr n 
Lo= > [r,miv;] = [r;m;v;]= const, 

i=1 i=1 
where r; =r; — ro, Vi = V; — Vc, and rg and vg are the 
radius vector and velocity of the system’s centre of mass. 
4.4.3 If a system is not closed, but the external forces acting 
on it are such that their resultant moment about fixed point O 
is identically zero (Mext = 0), then, according to Newton’s 
laws (4.3.1), the angular momentum of the system about the 
same point O remains constant with time: L = const. This 
condition is practically complied with, for example, by a bal- 
anced gyroscope (4.3.2) with three degrees of freedom, if the mo- 
ment of the friction force in its suspension is sufficiently small. 
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If the stand of the gyroscope, holding its centre of suspension al 
rest, is turned in any way, the axis of the gyroscope retains Its 
orientation with respect to a fixed inertial frame of reference. * 


Usually Mext=£0 and L$ const. But if the resultant moment 
of external forces about any fixed axis passing through point O 
is identically zero, the angular momentum of the system about 
this axis remains constant with time. For instance, if M ext = 0, 
then L, = const. 

lf the system rotates about fixed axis OZ and the resultant 


moment of external forces about this axis M&** = 0, the angular 
momentum of the system about the axis of rotation remains 
constant with time: 


/,@ = const, 


where œ and J, are the angular velocity and moment of inertia 
of the system. 


If the action of internal forces, as well as external forces that 


comply with the condition M&** = 0, leads to deformation 
of the system, changing its moment of inertia J,, the angular 
velocity w correspondingly increases or decreases. 

4.4.4 Free axes of a body are ones about which a free rigid body 
(2.2.3) can rotate at constant angular velocity @ in the absence 
of all external action. Such rotation of a body is said to be free, 
or inertial, rotation. The free axes of a body coincide with its 
principal central axes of inertia (4.2.4). In the general case, 
the values J,;, J, and J, of the principal central moments of 
inertia (4.2.4) differ. The free rotation of such a body (for ex- 
ample, a homogeneous rectangular parallelepiped with edges 
of different length) is accomplished practically only about two 
lree axes, corresponding to the extreme values of the principal 
central moments of inertia—the maximum and minimum val- 
ues. The rotation of the body about its third principal central 
axis, Corresponding to the intermediate value of the moment 
of inertia of the body, proves to be unstable: even very small 


* It is assumed that vector L is directed along the axis of the 
gyroscope. Otherwise, a free Eyer has a precessional motion: 
ils axis describes a circular conical surface whose vertex is at 


the centre of suspension and whose axis is directed along the 
vector L = const. 
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external action is capable of leading to substantial deviation 
of the instantaneous axis of rotation from its initial direction 
in the body. 

If the values of two principal central moments of inertia of 
a body are equal: J; = I Æ I}, stable free rotation of such 
a body (for instance, a homogeneous circular cylinder) is possi- 
ble only about the free axis corresponding to the third, differing 
value J, of the moment of inertia. For a homogeneous circular 
cylinder such a free axis is its axis of symmetry. But if a long 
thin cylinder is rotated by means of a string secured to one end, 
stable rotation of the cylinder can be accomplished about the 
free axis corresponding to the maximum value of its moment 
of inertia. This free axis is perpendicular to the axis of symmetry 


of the cylinder. 


CHAPTER 5 FUNDAMENTALS OF THE SPECIAL 
THEORY OF RELATIVITY 


5.1 Postulates of the Special Theory of Relativity 


5.4.1 The special theory of relativity (sometimes called the 
restricted relativity theory) is the modern physical theory of space 
and time. The special relativity theory and quantum mechan- 
ics (38.1.1) serve as the theoretical basis for all modern physics 
and engineering (for example, nuclear physics and nuclear 
technology). The special relativity theory is often called relati- 
vistic theory, and specific phenomena described by this theory 
are called relativistic effects. Relativistic effects are manifested, 
as a rule, at velocities of bodies close to that of light in free 
space (c = 3 X 108 m/s) and called relativistic velocities. Relati- 
vistic mechanics refers to the mechanics of motion at relativistic 
velocities and is based on the special theory of relativity. 

As in classical Newtonian mechanics it is assumed in special 
relativity theory that time is uniform (3.4.2) and that space 
is homogeneous (2.7.1) and isotropic (4.4.1). 

5.1.2 The special theory of relativity is based on two main 
principles accepted as the initial postulates. 

The first postulate is a generalization of Galileo’s mechanical 
principle of relativity (2.8.4) to cover any physical processes. 
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This postulate, called the principle of relativity, or Einstein's 
relativity principle, states: all physical phenomena proceed in 
lhe same way, under the same conditions, in all inertial frames 
of reference (2.1.2). In other words, the principle of relativity 
contends that physical laws are independent (invariant) with 
respect to the chosen inertial frame of reference; equations 
expressing these laws have the same form in all inertial frames 
of reference. Consequently, it is impossible to ascertain, by 
means of any physical experiments whatsoever, conducted 
in a closed system of bodies (2.2.4), whether the system is at 
rest or is travelling at uniform velocity in a straight line with 
respect to some inertial frame of reference. All inertial frames 
of reference are equally justified in physics. It is impossible, 
on the basis of physical experiments, to select from the host 
of inertial reference frames any “preferred” or “absolute” frame 
of reference possessing any qualitative distinctions from other 
inertial reference frames. 

9.1.3 The second postulate expresses the principle of the invariance 
of the velocity of light: the velocity of light in free space is 
independent of the motion of the source of light. It is the same 
in all directions and in all inertial frames of reference, being 
one of the most basic physical constants. Experiments show 
that c, the velocity of light in free space, is the limiting velocity 
in nature. The velocity of any particles and bodies, as well as 
the velocity of propagation of any interactions or signals, can- 
not exceed c. 

These specific laws governing the process of light propagation 
in free space enable this real physical process to be made use of 
in establishing a procedure for the chronometry of a frame of 
reference, i.e. for the synchronization of clocks located at various 
points in space and travelling together with the frame of reference 
heing considered (5.2.3). 

».1.4 The postulates of the special theory of relativity contra- 
dict the concepts of the properties of space and time that are 
accepted in classical mechanics and on which the Galilean 
lransformations are based (2.8.1). In particular, this refers 
lo the contention, considered to be “self-evident” in Newtonian 
mechanics, that the passage of time is identical in all inertia! 
lrames of reference. From this it follows that the time interval] 
between any two events is of an absolute nature. For instance, 
i! two events occur simultaneously according to the clock in 
one inertial frame of reference, then, according to classical 
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concepts, they also occur simultaneously according to the clock 
of any other inertial frame. 

The above-mentioned contradiction can be illustrated by the 
following example (Fig. 5.4). We have two inertial frames of 
reference: fixed frame K and frame K’ which travels along the 
OX axis at constant velocity V. Assume that a flash of light is 
produced at point O at the instant of the zero time reference 
in both frames K and K’ (t = t’ = 0), when their origins O 


and O’ of coordinates coincide. By the instant of time ¢ > 0, 
light, propagating in free space at the velocity c, reaches 
points at the surface of a hollow sphere in the frame K. This 
sphere has its centre at point O and its radius equals ct. 

It can be assumed that in frame K’ the light flash occurred at 
the instant of time ¢’ = 0 at point O’. Hence, according to the 
postulates of the special theory of relativity, by the instant 
of time ¢’ = t, light in frame K’ reaches points at the surface 
of a hollow sphere of the same radius ct as in the frame K but 
with its centre at point O’. At this instant, however, point O’ 
is not at point O, but at a distance of Vt from this point. Hence, 
any attempt to combine the postulates of the special theory of 
relativity with classical conceptions of absolute time that elapses 
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identically in all frames of reference leads to an absurdity: 
the light flash must simultaneously reach points in space that 
belong to two different hollow spheres. 


5.2 Simultaneity of Events. Synchronization of Clocks 


5.2.1 The concept of the simultaneity of two or several events 
is extensively employed in making various physical measure- 
ments. For example, to determine the length J of a rod, arranged 
along the axis OX of reference frame K and travelling with 
respect to this frame, it is necessary to note the values of the 
coordinates x, (t) and zx, (é) of the ends of the rod simultaneously, 
i.e. at the same instant of time t. Then 


l = | za (t) — zı (t) |. 


The instant of time at which some event occurs (for instance, 
the launching or touchdown times of a spaceship) is determined 
by noting a clock’s reading that is simultaneous with the event 
being considered. This can be simply done by means of a clock 
located at the place where the event occurs. Hence, there should 
be many clocks located at various points in space in each frame 
of reference. Obviously, these clocks should be in agreement 
with one another, i.e. run synchronously: their readings should 
be identical at each instant of time ż. 

5.2.2 The synchronism of clocks located adjacently, i.e. at 
the same place in space, can be checked by the coincidence of 
their readings at ach arbitrary instant of time. The synchronism 
of' clocks that are separated in space at the points A and B, 
distant from each other, could be checked in a similar manner 
if we had at our disposal standard time signals that could pro- 
pagate instantaneously from A to B. Experiments indicate, 
however, that this method is unfeasible because the velocity 
of any signal cannot exceed the velocity of light in free space. 

We can, however, proceed in the following manner. We can 
convey the clock from B to A, check whether it is synchronous 
with the clock located at A, and then convey the clock carefully 
back to B. Then we can check whether the clock brought back 
to B runs at the same rate as the clock left at A by transmitting 
lime signals from A to B after definite equal time intervals 
according to the clock at A. But it cannot be established by 
(his procedure whether a shift in the zero time reference has 
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occurred in the clock reading while it was being conveyed back 
to B, i.e. whether the clock brought back to B is now fast or 
slow with respect to the clock at A by a constant amount At. 
5.2.3 The problem of the synchronism of clocks separated in 
space at points A and B can be solved only by an unambiguous 
agreement (or definition) about when these clocks are to be 
regarded as synchronous. Einstein made use of a real physical 
process—the propagation of light in free space—as the basis 
for such a definition. He proceeded from the facts that the 
velocity of light in free space is the maximum possible velocity 
at which signals can be transmitted in nature and that it is 
identical in all directions and in all inertial frames of :ref- 
erence. 

Assume that according to the clock at point A the light signal 
is transmitted from this point at the instant of time ż and, 
after being reflected at point B, returns to A at the instant of 
time ¢,. Then, by definition, the clock at B is in synchronism 
with that at A if they run at the same rate and if, at the instant 
the light signal reaches B, the clock located there indicates the 
time to = (t + ts)/2. 

0.2.4 According to the special theory of relativity, the passage 
of time may differ in different inertial frames of reference. 
Consequently, the time interval between any two definite 
events is relative: it can change in going over from one inertial 
reference frame to another. In particular, the simultaneity of 
two events occurring at different points separated in space is 
also relative. Events occurring simultaneously in one inertial 
frame of reference are not at all simultaneous in other inertial 
frames that are travelling with respect to the first frame. In 
some frames of reference one of these events occurs before the 
other event; in other frames it occurs after the other event. 
Thus, in the example illustrated in Fig. 5.1 (see 5.1.4), the 
arrival of the light flash at points A and B are events that 
occur simultaneously in the fixed frame of reference K. In the 
travelling frame of reference K’ these events are not simulta- 
neous. The light flash reaches A, receding from the light source 
O’, later than it reaches B, approaching the source 0’. 

Events connected by a cause-and-effect relationship, i.e. by 
causality, cannot occur simultaneously in any frame of reference 
whatsoever, because any effect is due to some process initiated 
by the cause. At the same time, no process (physical, chemical, 
biological, etc.) can proceed instantaneously. Therefore, rela- 
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tivity does not contradict causality in any way. In any iner- 
tial frame of reference an event that is an effect always occurs 
later than the event causing this effect. 


5.3 Lorentz’s Transformations 


5.3.1 It follows from the postulates of the special theory of 
relativity, as well as from the homogeneity (2.7.1) and isotropy 
(4.4.1) of space and the uniformity of time (3.4.2), that the rela- 
tions between the coordinates and times of the same event in 
two inertial frames of reference are expressed by the Lorentz 
transformations, rather than the Galilean transformations (2.8.1) 
used in classical (Newtonian) mechanics. According to the prin- 
ciple of relativity and the above-mentioned properties of the 
symmetry of space and time, the Lorentz transformations should 
be linear. 

5.3.2 The Lorentz transformations are in their simplest form 
if the like axes of the Cartesian coordinates of the fixed (K) 
and moving (K’) inertial frames of reference are pairwise paral- 
lel to each other and if frame K’ travels with respect to frame K 
at the constant velocity V along axis OX (Fig. 2.3, see 2.8.1). 
If, in addition, the zero time reference in both frames (t = 0 
and t’ = 0) is taken as the instant when the origins O and O’ 
of coordinates of the two systems coincide, then the Lorentz 
transformations are of the form 


x—Vt ` oo) EVE 
"=Y VA 
y =y, y=y', 
Zz =2, z=2', 

,_ t—Va/c? _ + V2'[e? 

© VIV’? VISTE ’ 


where c is the velocity of light in free space. 

5.3.3 The Lorentz transformations show that not only the spatial 
coordinates of an event being considered are changed in going 
over from one inertial frame of reference to another, but the 
corresponding instants of time as well. But a definite interrela- 
lion exists between the spatial coordinates x’, y’ and z’ of 
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an event and the time instant t’ it occurs in an arbitrary inertial 
frame of reference K’. Hence, the quantity [(x’)? + (y’)? + 
-+ (z’)2 — c? (t')?] is independent of the velocity V of frame K’, 
i.e. it is identical in all inertial frames of reference: 


(x°)? + (y)? + (2’)® — 2 (t)? = 1? + y? + z2 — ee. 


The coordinate x’ and time ?¢’ cannot be imaginary. It follows, 
therefore, from the Lorentz transformations that the velocity 
of relative motion of any two inertial frames of reference cannot 
exceed the velocity of light in free space (V <c). 

5.3.4 According to Einstein’s principle of relativity (5.1.2), 
physical laws must comply with the condition of relativistic 
invariance (Lorentz invariance). This requirement implies that 
equations expressing physical laws retain their form in going 
over from one inertial frame of reference to another in accord- 
ance with the Lorentz transformations. 

The Lorentz transformations reduce to the Galilean transforma- 
tions (2.8.1) at V < c or, more precisely, in the limit as V/c > 
— 0, i.e. as c — oo. In other words, the Galilean transforma- 
tions and classical (Newtonian) mechanics based on them are 
founded on the assumption of instantaneous propagation of 
interaction. Such an approximate approach is permissible only 
in considering the laws of mechanical motion of bodies at veloci- 
ties a great many times less than the velocity of light in free 
space. 


5.4 Relativity of Lengths and Time Intervals. 
Time Interval Between Two Events 


5.4.1 It follows from the Lorentz transformations (5.3.2) that 
the linear dimension of a body travelling with respect to7an 
inertial frame of reference is reduced in the direction of motion: 
This change in the longitudinal size of a body in motion‘is called 
the Lorentz-FitzGerald contraction. Assume l to be the length 
of a rod at rest in reference frame K’. If the rod is located along 
axis O'X’ (Fig. 5.2), then lọ = z — zi, where x, and x} are 
the coordinates of the ends of the rod. The length J of the same 
rod in reference frame K, with respect to which it travels along 
axis OX at the velocity V, is equal to the difference in the coor- 
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dinates of the ends of the rod, measured at the same instant of 
time t: 


l= z (t)— z (t) = (25— x1) V 1—V2/e = la V 1— V2/c2. 


Transverse dimensions of a body are independent of its velocity 
and are identical in all inertial frames of reference: 


as , t EEEE { f 
Yo — Yı = Yo — y; and Za — 21 = 25 — Zi. 


Thus, the linear dimensions of a body are relative. They are 
maximal in the inertial frame of reference with respect to which 
the body is at rest. These are | 

said to be the body's proper yyy AYK’ 

dimensions. 


5.4.2 The Lorentz-FitzGerald 


contraction is a kinematic effect — A 
of the special theory of relativ- 
ity. It is not due to the effect of zy xw X 


any kind of longitudinal forces 
compressing the body along its O O xlt) ælt) X 
direction of motion. This con- 

traction becomes appreciable Fig, 5.2 

only at velocities approaching 

that of light in free space. It 

follows from the contraction formula that bodies cannot travel 
at velocities V >c because at V = c the longitudinal dimen- 
sion of the body (in the direction of motion) becomes zero, and 
at V->c it would have to become imaginary. 

5.4.3 It is evident from the Lorentz transformations that in 
the theory of relativity we can specify a definite “instant of 
time” only with respect to a single definite inertial frame of 
reference. Thus, for instance, a single “instant of time” in ref- 
erence frame K (a single definite value of time ¢ in this frame) 
corresponds to many values of time ?’ in frame K’, depending 
upon the values of coordinate z: 


a t—Vzx/c? 
C= —. ° 
V 1—V2/c2 


Likewise, a single “instant of time” in frame K’, i.e. a single 
definite value of time ¢’, corresponds to many values cf time / 
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in frame K, depending upon the values of coordinate z’: 
_ PJV? 
V1—V?/e2 ` 


5.4.4 Another important consequence of the Lorentz transfor- 
mations is the relativity of the time interval between any two 
events (for example, the beginning and end of some process), 
i.e. the dependence of this time interval on the choice of the 
inertial frame of reference. Assume that two events, 1 and 2, 
occur in the moving inertial frame of reference K’ at the same 
point A, fixed in frame K’ (i.e. r = x;), at the instants of time 
t; and t,. Thus, the time interval between the events is tT) = 
= Ía — th. 

With respect to fixed inertial frame of reference K, point A 
travels with the same velocity V as does frame K’. Hence in 
frame K events 1 and 2 occur at different points in space, having 
the coordinates xı and z>, and za — x; = Vt, where t = ft, — ty 
is the time interval between events 1 and 2 according to the 
clock in frame K. It follows from the Lorentz transformations 


that 


t 


e ae EEE a E S 
2 1 YI Ve VW 1—V3/e À 


Thus, the time interval between two events is minimal in the 
inertial frame of reference with respect to which both events 
occur at the same point. The time measured by a clock travel- 
ling together with some object is said to be the proper time 
of the object. 

5.4.5 The consequence of the Lorentz transformations that is 
dealt with in the preceding subsection is evidence of the existence 
of the relativistic: effect of the slowing of time, called time 
dilation, in a moving inertial frame of reference as compared 
to a fixed one. A clock travelling at the velocity V with respect. 


to a given inertial frame runs slower by a factor of V 1 — V2/c2 
than a fixed clock. Correspondingly, in agreement with the 
relativity principle, all physical processes in a moving frame 
of reference proceed at a slower rate than in a fixed frame. 

The time dilation effect becomes appreciable only at extremely 
high velocities V, close to that of light in free space. This effect 
has been demonstrated, for example, by experiments conducted 
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with muons (43.3.3). The muon is an unstable elementary parti- 
cle. The mean proper lifetime of muons (according to a clock in 
the inertial frame of reference with respect to which they are 
at rest) Tọ = 2.2 X 10-§ s. Muons are created in the upper 
layers of the atmosphere by the action of primary cosmic rays, 
and they travel with respect to the earth at velocities V close 
to c. If there was no relativistic effect of time dilation, a muon 
would be capable of travelling an average distance, with respect 
to an observer on the earth, of toc = 660 m through the at- 
mosphere. In other words, muons could not reach the earth’s 
surface. Actually, they are registered by instruments installed 
at the surface of the earth, because the mean lifetime of a trav- 
elling muon, according to the clock of the observer on the earth, 
T= n/V 1 — Ve > Tọ, and the path travelled by a muon 
during this time, tV >> 660 m. 

5.4.6 The relativistic effect of time dilation in a spaceship 
travelling with respect to the earth makes it feasible to under- 
take space flights of almost unlimited length and journeys 
“into the future”. According to the relativity principle, all 
processes in a spaceship, including the aging process of the 
astronauts, proceed in agreement with the same laws as on the 
earth. But in the spaceship time must be measured by a clock 
travelling together with it at the velocity V with respect to the 
earth. If V is close to c the clock in the ship runs considerably 
slower than earth clocks (at the space-launch complex), by a 


factor of 1/V 4:-— V2/c®. For example, at B = V/c = 0.99999 
the rates of the spaceship and earth clocks differ by a factor 
of 224. Consequently, in a time interval of tọ = 10 years in 
such a spaceship, one can make a space flight taking t = 2240 
years according to earth clocks, aging only 10 years during 
the flight. In this flight the spaceship can travel a tremendous 
distance from the earth: 1 = Vt = Pct = 2239.98 light years*. 
The closer V is to c, the greater the distance | travelled by the 
spaceship with respect to the earth in the same interval t, 
of prover time in the spaceship, i.e. the longer the space flight 
that the astronauts can make during their lifetime. 

If an astronaut viel abies a space flight at a velocity V close 
to c by returning to the earth, he will find that people remaining 


* A light year is the distance travelled by light in free space in 
one year: 1 light year = 9.4605 Xx 1015 m. 
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on the earth (including his twin brother) has aged more than 
he has during the’ flight. If V differs from c by a sufficiently 
small amount, when (1 — V2/c?)-1/2 >> 1, the astronaut can, 
during the flight, outlive all his contemporaries and find him- 
self, upon his return, among future generations. 

5.4.7 It would seem, on the face of it, that, on the basis of the 
relativity principle, we should arrive at a conclusion exactly 
the opposite of that given in the preceding subsection. The clock 
on the earth, travelling at the velocity —V with respect to the 
spaceship, should run slower than the clock in the ship. There- 
fore, the time required for the flight should be longer for the 
astronaut, and not for the people left on the earth. Hence, the 
twin that made the flight should age more than his brother. 

We have thus found that the difference between the readings 
of the clocks at the space-launch complex and in the ship is 
positive according to 5.4.6, while, on the other hand, it is 
negative. This absurd result is called the clock, or twin, paradoz. 
Actually, no clock paradox exists. It arose due to improper 
application of the relativity principle. This principle contends 
that not all frames of reference are equally justified, but only 
all inertial frames of reference. But the frame of reference of 
the spaceship, in contrast to that of the earth, is not an inertial 
one all throughout the flight. During launching, turning around 
and in braking orbit when approaching the earth, the spaceship 
travels with acceleration (or decelaration). Consequently, the 
problem concerning the running of the clock at the launching 
complex which is always at rest with respect to a single inertial 
frame, and that of the clock in the spaceship, is, in principle, 
not a symmetrical one. Hence, the earth and ship frames of 
reference are not equally justified in the given problem. The 
reasoning in Sect. 5.4.6 is the correct one because it is based 
on the use of an inertial (the earth) frame of reference. Accord- 
ingly, the reasoning given in Sect. 5.4.7, leading to the clock 
paradox, is incorrect. In the second case, the general theory of 
relativity should be applied, rather than the special theory. 
When this is done, it is found that, from the point of view of 
the astronaut as well, his clock runs slower than the clock at 
the launching complex. 

9.4.8 The interval, or space-time interval, between two events, 
measured in the inertial frame of reference K’, is the quantity 


Sig = y (ti)? — (lio)?, 


Hg w 
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where ti, = t, — t; is the time interval between the two events 
being considered (according to the clock in frame K’), and 
lia = V (x3 — z1)? + (y3 — yi)? + (25 — 21)? is the distance 
between the points at which events 1 and 2 occur, also measured 
in frame of reference K’. 

[t follows from the Lorentz transformations that the interval 
between two given events, 1 and 2, is invariant with respect 
to the choice of the inertial frame of reference, i.e. it does not 
change in transferring from moving inertial frame K’ to fixed 
frame K: 


’ e 
Sig = S12 = inv, where s= V 2t, — 1. 


If s2, > 0, i.e. if sı is a real number, the interval sı is said 
to be timelike. 

The interval sı is said to be spacelike if s?, < 0, i.e. sy. is an 
imaginary number. 

5.4.9 It follows from the invariance of the interval with respect 
to the choice of the inertial frame K’ that in all frames K’ the 
values of #;, and lia for the giv- 
cn two events, 1 and 2, satis- 
ly the equation of a hyperbola: 


5 (iia)? = (Li)? = sto. 


If s2,>>0° the relation between 
‘ig and Jj, in various inertial 
lrames of reference K’, travel- 
ling with respect to fixed frame K 
at any possible velocities (0 < 
"i V <c), can be represented 
raphically as the two branches 
/ and II of a hyperbola (Fig.5.3). 
Consequently, the sign of the 
lime interval between events 1 Fie. 5.3 

and 2, related by a timelike in- rae ws 

terval, is absolute. It is indepen- 

(lent of the choice of the frame of reference: in all frames K’ the 
second event always occurs either after the first, i.e. t/, > 0 
(branch J) or always before the first, i.e. tj, < 0 (branch IJ). 
The distance li, is relative; an inertial frame K’ can be chosen 
in which lj, = 0, i.e. events 1 and 2 occur at the same place 
(points A and B on the branches J and JI of the hyperbola). 
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Two events related by causality should always correspond to 
a timelike interval or, in any case, one equal to zero (si = 0). 
This is due to the fact that the signal by means of which event 1 
(the cause) leads to event 2 (the effect) cannot be propagated 
in space at a velocity exceeding that of light in free space: 
lie Sc (tg — t). 

5.4.10 In the case of events related by a spacelike interval 
(s2, < 0), the sign of t;, is relative: tj, >> 0 (upper part of hyper- 
bola JII in Fig. 5.3) in certain inertial frames K’, whereas 
in others ¢}, < 0 (lower part of hyperbola JII). Point C corres- 
ponds to the frame K’ in which ti, = 0, i.e. events 1 and 2 occur 
simultaneously. 


5.5 Transformation of Velocities and Accelerations 
in Relativistic Kinematics 


5.5.1 The velocities v and v’ of a particle in two inertial frames 
of reference K and K’ are 
dr ; 


e ° dr’ i iy P 35 fs) 
v=- Pai yj rok and v = Gyr = vail + vy j’ +vz'k 


where r = zi + yj + zk and r = 2'i’ + y’j’ + 2’k’ are the 
radius vectors of the particle in frames K and K’. The projections 
of velocities v and v’ on the axes of Cartesian coordinates of 
frames K and K’ are 


dz dy dz >... bx’ , __ ay’ 


vy =— vy =—, v= —; v= ,=— and 
a WT T’ So a | 
sda’ 
at’ 


If the like axes of the Cartesian coordinates of frames K’ and 
K are pairwise parallel, and frame K’ travels with respect to 
frame K at the constant velocity V along axis OX (Fig. 2.3, 
see 2.8.1) and, in addition, if at the zero time references in 
frames K and K’ (t = 0 and #’ = 0), the origins O and O’ of 
coordinates of these frames coincide, the Lorentz transforma- 
tions in the form given in 5.3.2 are valid. It follows from these 
transformations that the relations between the projections of 
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the velocities of the particle on the axes of the Cartesian coordi- 
nates of frames K and K’ are of the form: 

net vy — V ron ve’ +V 

” 1 Vosle? * ga pyle et! 


, _vy V 1—V3/e? ; _ vy ViVe 


, 


v 


y= Vogl? ? VO AEV 
wy zV iVe : vr VI—V2/c? 
PE ae Ai is aW Ae es A 


1—Voyle2 °? © = 14- Vox /e? 

‘These equations express the law of the composition of velocities 
in relativistic kinematics. In the limit as c — oo these equations 
lead to the ordinary law for the composition of velocities in 
classical mechanics (2.8.2): 


Vi = Vy — V, vy: = Vy, vz, = v, and v’ = v — V. 


5.5.2 The relations between the squares of the magnitudes of 
vectors v and v’ are 


v? = c? | j= EAO Re i) 
(1-+ Vox /c?)? 


1 —v2/c2) (L—V2/c? 
(v) =e? [1— ( on c?) k 


In particular, ‘if v’ = c, then v = c and vice versa. Hence, 
if the velocity of a particle with respect to some inertial frame 
of reference is equal to the velocity of light in free space, then 
it must have the same magnitude with respect to any other 
inertial frame regardless of the velocity of relative motion of 
these two frames. In other words, the sum of two velocities, 
of which one is equal to c, is always equal to c. This law, detect- 
ed in the motion of such elementary particles as photons (37.1.4) 
and the neutrino (Table 43.1), demonstrates the limiting nature 
of the velocity of light in free space (5.1.3). 

5.5.3 It is evident from the equations in the preceding subsec- 
tion that a particle, travelling with respect to some inertial 
frame of reference at a velocity less than c, has a velocity with 
respect to any other inertial frame that is also less than c (for 
instance, if v < c, then v < c, and vice versa). From this it 
follows, among other things, that no matter how close to c 


7—0186 


and 
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the velocities of two particles are, their relative velocity is 
always less than c. Assume, for example, that two particles are 
travelling toward each other along the axis OX of frame K at 
the respective velocities: vı = 0.8ci and v, = —0.8ci. The 
velocity u» of the second particle with respect to the first is 
not equal, as is assumed in classical mechanics, to the vector 
difference v, — vı = —1.6ci if only because the magnitude of 
this velocity exceeds c. The required velocity is equal to the 
velocity of the second particle with respect to inertial frame K’, 
travelling together with the first particle (V=0.8ci), i.e. ux, = v3. 
It follows from the equation in 5.5.1 that 


RRR fal © sel Se E aa ee ee 
Vo x ~ 1 —Vigx/e2 +0. 64 0.976c, Voy = Vo,-=0, 
i.e. Wo; = —O0.976ci’ and | ug, | <c. 


5.5.4 The projections of the acceleration of a particle on the 
axes of the Cartesian coordinates of two inertial frames of refer- 
ence K and K’ (5.5.4) are related by the following equations; 


i dvx’ g EN, 
—_——_ = a, | — ’ 


x’ dt Vox 
1— 2 
Oy y =| (Gs a) ro ox | (=) | 
c2 


, dbz! Vv Vv AVe 
S a (IEE) at Bee] re 


dV V 1—V2/c2\ 3 
a a (ener orm 
1- = 
vy Von \ + Voy or 1— V2 /c? 
ame Fl (1+ 2 ae) as | Von 3? 
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5.6 Basic Law of Relativistic Dynamics 


5.6.4 In contrast to classical mechanics, the mass of a particle 
is not constant in relativistic mechanics. It depends upon the 
velocity v of the particle. Its value m differs in two inertial 
frames of reference moving with respect to each other. The 
dependence of mass on velocity is expressed by the equation: 


Mo 


MS j 
V 1—v?2/c? 


where mọ is the rest mass of the particle, i.e. its mass measured 
in the inertial frame of reference with respect to which the 
particle is at rest, and c is the velocity of light in free space. 
The mass m is frequently called the relativistic mass. 

The effect of the velocity of a particle on its mass becomes 
appreciable only at values of v close to c. For example, m/m} = 
= 1.005 at v/c = 0.1 and m/m, = 2.29 at v/c = 0.9. It is 
evident from the law for the dependence of m on v that particles 
with a rest mass mọ =Æ 0 cannot travel at velocities greater 
(han or equal to c (i.e. v < c). At the same time, particles whose 
rest mass equals zero [photons (37.1.4) and the neutrino (Table 
43.1)| cannot have a velocity other than c. 

5.6.2 The momentum p = mv of a particle is a nonlinear func- 
tion of its velocity: 


MoV 


eee 


Vector p is sometimes called the relativistic momentum of the 
particle (to distinguish it from the value myv of its momentum 
in classical mechanics). It is obvious that at v <c, the mo- 
mentum p = mv > MoV. 

By virtue of the homogeneity of space (2.7.1), the law of conser- 
vation of relativistic momentum is valid in relativistic mechan- 
ics: the momentum of a closed system (2.2.4) does not change 
in the course of time. 

l'ollowing from this law is the law of conservation of relativistic 
mass: iN any processes that take place in a closed system, its 
total relativistic mass remains unchanged. 


1% 
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5.6.3 The basic law of relativistic dynamics is: the rate of change 
in the momentum of a particle is equal to the force F acting 
on it, i.e. 


dp _ d MoV ` 
e P or 7; (Sea) =r. 


Note. If the particle is subject to the simultaneous action of 
several forces, F is understood to be the resultant force (2.2.2). 
5.6.4 An element of work done by force F in the infinitely small 
displacement dr of its point of application is 


ôW = (E dr) = (Fv) di. 
Combining the basic law of relativistic dynamics (5.6.3) and 
the dependence of mass on the velocity (5.6.1) we obtain 


dv dm dm mv av 
Pem al dt and dt c?—v*® dt‘ 


Tlence 
ÔW = (Fv) dt = m (v dv) + dm = mv dv + vdm = c@dm. 
5.6.5 The acceleration imparted to a particle by the force F is 


dv F v dm 4 vo 
ge a aN) 


Hence, in contrast to classical mechanics, the acceleration of 
a particle in relativistic mechanics does not, in general, coincide 
in direction with the force imparting the acccleration. Vector a 
is collinear with force F in only two cases: 

(a) when force F is perpendicular to the velocity v of the particle 
(transverse force) so that (Fv) = 0 and 


F F y v2 
te m y ee a 
m Mo C 


(b) when force F is parallel to the velocity vector v of the par- 
ticle (longitudinal force) so that v (Fv) = v?F and 


2 2 \3/2 
n= (1-S)=— (1-4)" l 
Cc 
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A longitudinal force imparts to a particle an acceleration less 
by a factor of 1 — v?/c2 than that imparted by a transverse force 
of the same magnitude. This is due to the fact that a transverse 
force changes the velocity of a particle only in direction (the 
magnitude v of the velocity and the relativistic mass are not 
changed), whereas a longitudinal force changes the magnitude 
of the velocity of a particle as well as its mass. 


5.7 Mass-Energy Relation 


5.7.1 The increment of the kinetic energy E, of a particle is 
equal to the work done by the force F acting on the particle 
(5.6.4): 


dE, = 6W = c? dm, 


where dm is the corresponding increment of relativistic mass 
of the particle. It follows that 


4 
Ex = (m — c2—=m e| 1] 
k ( Mo) 0 V 1—v/c2 , 


where m, is the rest mass of the particle. Expanding (1 —v?/c?)-1/2 
into a Maclaurin series we obtain 


E k = mgc? |- (>il). 


C 


At v & c this equation reduces to the ordinary expression for 
the kinetic energy in classical mechanics: 
O mov? mv? 


DES as 
ee) 2 


5.7.2 It follows from the first equation in the preceding sub- 
section that an increase in the kinetic energy of a body should 
be accompanied by a corresponding increase in its relativistic 
mass m: 


4 
dm =~ dE. 


Changes in other kinds of energy of a body are also | associated 
with an increase in its relativistic mass. If, for example, in 
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heating a body at rest its internal energy (9.1.2) is increased 
by the amount dU, the mass m of the body, equal to its rest 
mass my, is increased by 


1 
dm =dm}=-7 dU. 


In the general case, a change in the total energy E of the body 
by the amount dE is accompanied by a change in its relativistic 
mass by the amount 


dm ee dE. 
c2 


Accordingly, the following universal relation exists between E 
and m: 


Mc? , 
V T= 
this is known as the mass-energy relation: the total energy of 
a body (or system) is equal to the product of the relativistic 
mass of the body (or system) by the square of the velocity of 
light in free space. 
By virtue of the uniformity of time (3.4.2), the law of the con- 
servation of energy is valid in relativistic mechanics as it is in 
classical mechanics: the total energy of a closed system (2.2.4) 
does not change in the course of time. 
It follows from the mass-energy relation that the laws of the 
conservation of relativistic mass (5.6.2) and of the total energy 
are not independent laws. 
5.7.3 The total energy of a particle or a system of particles (lor 
instance, of an atomic nucleus, atom, molecule or body) at 
rest is equal to 


E = mce? = 


Ep = MoCc?, 


where mọ is the rest mass; FE, is called the rest energy of the 
particle or system. The values of mọ and E, are independent 
of the choice of the inertial frame of reference. These quantities 
are constant characteristics of structureless (elementary) par- 
ticles, similar, for example, to their electric charge and spin 
(Table 43.1). The rest mass and rest energy of a system of par- 
ticles depend upon the composition of the system and its inter- 
nal state. For example, the rest mass of an excited nucleus (or 
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atom) is greater than the rest mass of the same nucleus (or atom) 
in the ground state. 
5.7.4 The total energy Æ of a particle and its “momentum p 
are related by the equations: 

2 Se 
p=— vor I -p= mgc? or E= V pre mégct , 
The values of the total energy, relativistic mass and momentum 
of a given particle, in contrast to its rest mass mg, are relative, 
i.e. they are different in two inertial frames of reference K 
(E, m and p) and K’ (E’, m and p’). But the difference between 
the square of the total energy of a particle divided by c? and 
the square of the momentum of the particle, like the interval 
between two events (5.4.8), does not depend upon the choice 
of the inertial frame of reference: 
(Ey ng EB 


ge NE ae Pen 


3.7.5 In going over from one inertial frame K to another K’ 
that is travelling with respect to the first at the velocity V = 
= const along the OX axis (Fig. 2.3, see 2.8.1), the projections 
of the momentum of the particle on the coordinate axes and 
its total energy are transformed as follows: 


VE j VE’ 

; Px p Per + 
Por =p; 2S as 
* VT Ve “VT —V2]e8 

Py = Py, Py = Py», 
Py = Pz, Pr=Py, 
t E—Vp, E'+Vp,,. 
1 = —a, E = —, 
V 1—V?/c? V 1—V3/e 


5.7.6 It does not at all follow from the laws of conservation of 
relativistic mass (5.6.2) and of total energy (5.7.2) that the rest 
mass and rest energy of a closed system cannot change. For 
instance, the sum of the rest masses of free protons and neutrons 
i always greater than the rest mass of the atomic nucleus they 
orm. 
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A quantity introduced to characterize systems held together 
with a margin of safety (for example, atomic nuclei, atoms and 
molecules) is the concept of the binding energy. The binding 
energy of a system is measured as the least amount of work that 
must be done to separate the system into its component parts 
(for instance, an atom into its nucleus and electrons). The 
binding energy of a system is 


n 
Ep = >, maic? — Moc?, 
i=! 
where M, is the rest mass of a system of n particles, and mp; 
is the rest mass of the ith particle in the free state. The quantity 
n 
Am = >») Moi == Mo == 


i=1 


Ep 
c2 


is sometimes called the mass defect of the system. 


CHAPTER 6 GRAVITATION 


6.1 Law of Universal Gravitation 


6.1.1 Isaac Newton’s law of universal gravitation states that the 
force of mutual attraction between any two particles is directly 
proportional to the product of their masses and inversely pro- 
portional to the square of the distance between them. Such 
forces are called gravitational forces. If m; and m, are the masses 
of the particles being considered and r; and r, are the radius 
vectors of these particles, then, according to the law of univer- 
sal gravitation, the force F}, attracting the first particle toward 
the second particle, and the force F, attracting the second 
particle toward the first particle are equal in magnitude and 
opposite in direction; they are equal to: 

MyM, 


Fio= —G—— l12 and Foy = —G —z ~ rr 
ris r31 
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Ilere rj. = ry — r and rg; = rg — r; are radius vectors from 
the second particle to the first and from the first to the second, 
respectively, and ris = ra = | Tig | = | ra | is the distance 
between the particles. The proportionality factor G is called the 
constant of universal gravitation, or the gravitational constant. 
In value the gravitational constant is equal to the force of 
mutual attraction of two particles offunit mass with a distance 


Fig. 6.4 


of unit length between them. It has been found from experi- 
ments that 
(| = (6.6720 + 0.0041) x 10-" (N-m?)/kg?. 


6.1.2 The gravitational interaction of two bodies of arbitrary 
size and shape (Fig. 6.1) is described by the equation 


Fis = —G \ Pı dV | Pi T12 dV, 
(V1) (Va) 


where r} is the radius vector from the infinitely small element 
dV of volume of the second body to the infinitely small el- 
ement dV, of volume of the first body, p; and p, are the densities 
of the two elements, and integration is carried out throughout 
the whole volume of both bodies. 

The calculations of force F,, are substantially simplified in 
two Cases: 

(a) when there is a spherically symmetrical distribution of the 
masses in the two bodies, i.e. both bodies are of spherical shape 
and the density of each body depends only upon the distance 
to its centre (in particular, the bodies can be homogeneous); 
(b) when one of the bodies is of negligibly small size with re- 
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spect to the other body in which the distribution of its mass is 
spherically symmetrical. 
In these cases 


Fi2=—G 


3 Fiz, 
12 


where m, and m, are the masses of the bodies, and rj, is the ra- 
dius vector connecting the centres of mass (2.3.3) of the second 
and first bodies. l 
6.1.3 As a first approximation, it can be assumed that the earth 
is of spherical shape and the distribution of its mass is spherical- 
ly symmetrical. Hence the gravitational force F, exerted by the 
earth on a body of mass m, is directed toward the centre of the 
earth and its magnitude is 


mME 
72 


F=G 


where My is the mass of the carth and r is the distance from 
the body to the earth’s centre (the’size of any body on earth 
is negligibly small compared to the radius of the earth). 

6.1.4 In application to such microscopic objects as elementary 
particles, gravitational interaction is practically of no import- 
ance whatsoever. It is found to be extremely weak in comparison 
to all other kinds of interaction: strong, electromagnetic and 
weak interaction (43.2.5 through 43.2.8). For example, the 
electric force of mutual repulsion of two electrons exceeds their 
gravitational force by over 104? times! Even for ordinary macro- 
scopic objects on the earth the forces of gravitational inter- 
action are extremely small. Two homogeneous spheres, for 
instance, with a mass of 1000 kg each and a distance of one 
metre between their centres are attracted to each other with 
a force of only 7 x 10-° N. 

But, at the same time, gravitational forces are the determining 
factors in the motion of objects that are investigated in astron- 
omy and astronautics (spaceships, planets and their satellites, 
planetary systems, stars, etc.). This is due, primarily, to the 
huge size of astronomical bodies and, in the second place, to 
the small electromagnetic forces of interaction of such bodies, 
which, as a whole, are practically electroneutral. 
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6.2 Gravitational Field 


6.2.1 Gravitational interaction between bodies is due to the 
vravitational field set up by the bodies. A distinctive feature of 
a gravitational field is that a particle placed in this field is 
subject to a force proportional to the mass of the particle. 

The force characteristic of a gravitational field is its intensity, 
the vector quantity gr, which is equal to the ratio of the force F, 
exerted on the particle by the field in which it is placed, to 
(he mass m of the particle: 


F 
Beee 

m 
The intensity of a gravitational field is independent of the mass 
m of the particle. It is a function of the coordinates (x, y, 2) 
of points of the field being considered. In the case of an unsteady 
"eld the intensity also depends upon the time ¢. 
\ gravitational field is steady, or stationary (2.2.1), if the body 
that sets it up is fixed in the frame of reference chosen to describe 
the field. The intensity of a steady gravitational field depends 
only upon the coordinates: gp = gt (z, y, 2). 
It follows from Newton’s second law (2.4.3) that the forces of 
a gravitational field impart an acceleration a to a free particle 
in the field. This acceleration equals the intensity of the field: 


it ae 
ma 21 


6.2.2 It follows from the law of universal gravitation (6.1.1) 
Ihat the intensity of the gravitational field of a fixed particle 
of mass M, located at the origin of coordinates, equals 


iy a —G T, 


r3 
where r is the radius vector of the point being considered in the 
held. 
\ gravitational field is a conservative, or potential, one (3.1.6) 
because the force acting on a particle of mass m brought into 
the field is a central force (3.3.4); 


mM r 


Fo mgro G23. 
g Ro oF 
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Correspondingly, the potential energy of a particle in such 
a field is equal to (3.3.4):* 


o0 oo 
: d 
Ep=| P, dr= —GmM | =~ mM . 
r r 


It is equally correct to regard the quantity Ep as the potential 
energy of a particle of mass M in a gravitational field set up by 
a particle of mass m or, finally, as the mutual potential energy 
of two particles due to their gravitational interaction. 

6.2.3 A gravitational field complies with the principle of super- 
position of fields: in superposing several (n) gravitational fields 
their intensities at each point in space are added vectorially, 
i.e. the intensity of the resultant field is 


n 


g= > Sfi, 


i=1 


where g;; is the intensity of only the ith field at the point being 
considered in space. | 

The intensity of the gravitational field of an arbitrary system 
consisting of n fixed particles is 


where 9; = r — r; is the radius vector from the ith particle, 
whose radius vector is equal to r;, to the point in the field’ spec- 
ified by radius vector r. Consequently, the potential energy of 
a particle of mass m in this gravitational field is 


n 
m; 
Ey= —Gm — 
Ẹ z Pi’ 
i= 


* Here and throughout Sects. 6.2 and 6.3 an infinitely distant 
point is chosen as the reference point for the potential energy, 
i.e. it is assumed that Ep (œ) = 0. 
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in particular, if the gravitational field is set up by a body whose 
mass M is distributed in a spherically symmetrical manner 
(6.1.2), then outside this body 


GmM 
r 


M 
fee — —G -7 ! and Ep= — 


? 


where r iş a radius vector from the centre of the body to the 
point being considered in the field. These equations are valid, 
for example, for the gravitational field of the earth. 

6.2.4 By virtue of the conservative nature (potentiality) of the 
gravitational field (3.1.6), its energy characteristic, the potential, 
can be introduced. The potential of a gravitational field is the 
scalar p, equal to the ratio of the potential energy Ep of a par- 
Licle, placed at the point being considered in the field to the 
mass m of the particle: | 


(p a$ e 


The potential @ is independent of the mass m of the particle; 
it is a function of the coordinates of points of the gravitational, 
eld. For example, the potential of a gravitational field set up 
hy a fixed particle of mass M is 


GM 
r 


(= — 


? 


where r is the distance from the source of the field to the point 
being considered. 
The potential of a gravitational field set up by an arbitrary 


system of n fixed a a 
n 


m; 
p= — > G — 


Oi 


i=1 


where p; is the distance from a particle of mass m; to the point 
heing considered in the field. Thus, in superposing gravitation- 
nl fields, their potentials are added together algebraically, 
i.c. the potential ọ at any point of a resultant field is equal to 
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the algebraic sum of the potentials at the same point for all 
the superposed fields taken separately; 


n 
p= >) gi. 


i=l 


Note. In applying this equation it is necessary that the same 
reference point be chosen for the potentials ọ; of all the fields 
being superposed: @; (co) = 0 (see the footnote to Sec. 6.2.2). 
6.2.5 An element of work done by the forces of a gravitational 
field in an infinitely small displacement dr of a particle of mass m 
in this field is: 


dW = (F dr) = m (gr dr). 


But, on the other hand, the work dW is equal to the decrease 
in the potential energy of the particle in the gravitational 
field: 


dW = —dEp = —m dọ. 


Consequently, the potential and intensity of a gravitational 
field are related by the equation: 


dp = —(gr dr) = — (8ty dx + gry dy + Erz d2), 


where giy, ty and gf, are the projections of vector gr on the 
axes of rectangular Cartesian coordinates. Since 


_ 99 , 2P dp 
Ma gy oe 
then 

ð TO) 
ge a a and Saar ee 
Hence 

_  ({(9P ., ô, I _ 
hei ay ita) erd o, 


i.e. the intensity of a gravitational field is numerically equal 
ane opposite in direction to the gradient of the potential of 
the field. 
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The relation between ọ and gy can also be written in the form 


d 
dip=—grdlcosa=—gy,dl or g= Se, 
where œ is the angle between vectors g; and dr, dl = | dr |, 


and gr, is the projection of vector gs onto the direction of vec- 
lor dr. Thus, the projection of the intensity vector of a gravita- 
Lional field onto some directionis numerically equal and opposite 
in sign to the change in the field potential per unit length in 
(he same direction. 

(6.2.6 The nonrelativistic theory of gravitation discussed above 
and based on Newton’s law of universal gravitation is only 
approximate. It describes with sufficient accuracy only compar- 
atively weak gravitational fields, having a potential | @ | < c?, 
where c? = 9 X 101° m?2/s? is the square of the velocity of light 
in free space. It is, in particular, applicable to the gravitational 
liclds of the earth and sun because the absolute values of the 
potentials of these fields at the surfaces of the earth and sun 
ure equal to 6.3 X 107 and 1.9 X 10! m?/s2. 

6.2.7 The modern (relativistic) theory of gravitation, which is 
n unified theory of space, time and gravitation, was formulated 
by Albert Einstein. He called it the general theory of relativity. 
The existence of a close relationship between space and time 
had already been indicated in his special theory of relativity. 
This relationship is evident in the Lorentz transformations 
(0.3.2) and the invariance of the interval between two events 
(0.4.8). It was found necessary, in describing physical processes, 
lo employ a four-dimepSional space-time continuum, in which 
the position of a point is specified by three space coordinates 
and the time coordinate ct. 

According to the relativistic theory of gravitation, the geo- 
metrical properties (metric) of the space-time continuum 
depend upon the distribution of gravitating masses and their 
motion in space. Bodies setting up a gravitational field “distort” 
real three-dimensional space and differently change the passage 
of time at various points in this space. This means that the 
bodies cause a deviation of the metric of space-time from that 
of the “flat” space-time described by Euclidean geometry and 
dealt with in the special theory of relativity. Therefore, the 
motion of a body in a gravitational field can be regarded as 
inertial motion, but in a “distorted” (non-Euclidean) space- 
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time continuuin. Accordingly, a particle, subject to the action 
of a gravitational field, travels neither in a straight line nor 
at uniform velocity in a real three-dimensional space. 

It was shown in the relativistic theory of gravitation that the 
principle of superposition (6.2.3) does not hold for arbitrary 
gravitational fields. This principle, as well as the whole non- 
relativistic theory of gravitation, is sufficiently accurate only 
in weak fields (| ọ | < c?) and for motion in these fields at low 
velocities v & c. 


6.3 Kepler’s Laws. Space Velocities 


6.3.1 The motion of the planets of the solar system along 
their orbits around the sun complies with the three laws of 
Kepler. These laws can be derived from Newton’s law of uni- 
versal gravitation, dealing, as a first approximation, with 
the sun and planets as particles. In the central force field of 
gravitation of the sun, a planet of mass m is subject to the 
action of the gravitational force 


mMs 
r3 


F = —G r, 


where Mg is the mass of the sun and r is the radius vector of 
the planet from the centre of forces O, which is taken as the 
origin of coordinates. 

The moment of force F about the centre of forces is M = 
= [rF] = 0. Hence, the angular momentum L of the planet 
about the same point O does not change with time (4.3.1): 
L = [rmv] = const. 


Consequently, the planet travels along a plane path. (orbit), 
whose plane is perpendicular to vector L. According to Sect. 1.3.6, 
L = [rmvọl, where vg is the transverse velocity of the planet. 
Therefore, the orbital motion of the planet satisfies the con- 
dition 

dp L 


2 ee 
r E7 las const, 


where r and @ are the polar coordinates of the planet. 
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A second condition is imposed by the law of conservation 
of mechanical energy: Ek + Ep = E = const. According to 
1.3.6 and 6.2.2 


so that the second condition is of the form 


( dr i ( L j= 2GMs_ 2E 


dt mr r m ` 


6.3.2 The equation of the path of the planet (in the polar coor- 
dinates r and @) is 


ee ER 
1+ecos@ ’ 


where p = L2/Gm*Mg and e= V (2EL7/G?m®M2) + 1. The 
(otal mechanical energy of the planct E < 0, so that e<1 
and the path (orbit) of the planet is an ellipse. 

Kepler’s first law states that all the planets of the solar system 
travel along elliptic orbits of which the sun occupies one focus. 
lų follows from the first condition (6.3.1) that the areal ve- 
locity (4.3.6) of the planet is constant: 


dp L _ 
ae saa Cony 


Nepler's second law states that the radius vector of a planet 
sweeps out equal areas in equal lengths of time. 

6.3.3 According to Kepler’s second law, the orbital period, or 
period of revolution, T of a planet about the sun is equal to 


N -0186 
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the ratio of the area A of the orbit to the areal velocity o: 


p4 mab 


o o ? 


where a = p/(1 — e”) and b = a V1 —¢ are the major and 
minor semiaxes of the elliptic orbit. Thus 


T2 — _ wp a3 _ 4 
L?/4m? GMs 
This equation is an expression of Kepler's third law: the squares 
of the periods of revolution of the planets about the sun are 
proportional to the cubes of the major semiaxes of the elliptic 
orbits of these planets. 
6.3.4 The orbital velocity is the minimum velocity that it is 
necessary to impart to a body to convert it into an artificial 
satellite of the earth. It is also called the circular orbital velocity 
because it is equal to the velocity of an artificial satellite in 
circular orbit about the earth in the absence of air friction 
of the atmosphere. The orbital velocity is 


Vy = p= J 


where Mp is the mass of the earth, and r is the radius of the 
circular orbit. At the earth’s surface v, = 7.9 km/s. 

6.3.5 The escape velocity is the minimum velocity that it is 
necessary to impart to a body to enable it, without any addi- 
tional forces, to overcome the earth’s gravitational attraction 
and become an artificial satellite of the sun. It is also called 
the parabolic velocity because it launches the body on a para- 
bolic orbit in the ecarth’s gravitational field (in the absence 
of air friction of the atmosphere). The escape velocity 


2GM a 
n= poa 


where r is the radius from the launching site of the body to 
the centre of the earth. At the earth’s surface v, = 11.2 km/s. 
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6.3.6. The solar escape velocity is the minimum velocity that 
it is necessary to impart to a spacecraft, launched at the earth’s 
surface, to overcome the sun’s gravitational attraction and 
leave the solar system. This velocity. is və = 16.7 km/s. 


CHAPTER 7 MOTION IN NONINERTIAL FRAMES 
OF REFERENCE 


7.1 Kinematics of Relative Motion 


7.1.4 It is assumed in classical (Newtonian) mechanics that 
distances and time intervals remain unchanged in going over 
from one frame of reference 

lo any other frame that is in Yho 
motion in even the most 
arbitrary manner with re- 

spect to the first frame. For 
example, assume that K is 

an inertial frame with its r“ 
origin at point O* an is 

a noninertial frame with its 
origin at point O (Fig. 7.1). 
In the general case, the mo- Z 
lion of frame S with respect 

to frame K can be regarded a5. “ee 

as the sum of two motions: Fig. 7-4 

translational motion of point 

O at the velocity vg and rotation about this point at the 
angular velocity Q. Radius vectors r* and r of arbitrary 


particle M, measured in frames K and S, are related by 
Lhe equation 


OF 0 


OK LW pk J 
e  Ts 


where rë is the radius vector of point O measured in frame of 
reference K. 

7.1.2 The motion of particle M with respect to any inertial 
Irame of reference K, conditionally assumed to be fixed, is 
called the absolute motion of particle M. The motion of the 


Kak 
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same particle with respect to the noninertial frame of refer- 
ence S is said to be relative motion. 

The relative velocity v, of particle M, i.e. its velocity with 
respect to frame i g 


dz i 
Vr = TEK dt T 4S = k, 


where x, y and z are the Cartesian coordinates of particle M, 
and i, j and k are the unit vectors of the coordinate axes in 
frame S. 
The absolute velocity of particle M, i.e. its velocity v in the 
frame of reference K, cquals 

dr* di dj dk 
Meg OE agg Pe gg ee gg ee 


where vo = drg/dt is the absolute velocity of point O. Since 
the unit vectors of moving frame S can change in frame of 
reference K only due to the rotation of frame S about point O 
at the angular velocity Q, 

di ~ dj , dk 

— == [Qi], Pr and Ti == [Qk] 


v = W + Vp 


where vy = Vo + [Qr] is the velocity of transportation of par- 
ticle M. It is also called the bulk, or drift, velocity and is equal 
to the absolute velocity of the point of moving frame of ref- 
erence S (i.e. rigidly fixed in this frame) in which particle M 
is located at the given instant of time. 

7.1.3 The relative acceleration a, of particle M (its acceleration 
with respect to frame S) equals 


dea dy ES! Soo 
ar Sga ge I ae 


The absolute acceleration of particle M, i.e. its acceleration 
with respect to frame K, equals 


=at+ac+ ar. 
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am| Sr Hear 


is the acceleration of transportation of particle M. It is equal 
to the absolute acceleration of the point in moving frame S 
in which particle M is located at the given instant of time, and 


ac = 2 [Qvr] 


is the Coriolis acceleration of particle M. The Coriolis accelera- 
lion has its maximum value if the relative velocity v, of the 
particle is perpendicular to the vector Q of angular velocity 
of the moving frame of reference. It is equal to zero if the angle 
between vectors v, and Q equals zero or x, or if even one of 
these vectors equals Zero. 


7.2 Inertial Forces 


7.2.1 Newton’s laws of motiof ard not valid in noninertial 
lrames of reference. In particular, a particle can change the 
state of its motion with respect to noninertial frame S without 
any action being exerted on the particle by other bodies. A ball, 
lor instance, suspended by a string from the ceiling of a railway 
coach of a train travelling at uniform velocity in a straight 
line, deviates backward when the motion of the train is accel- 
crated and frontward when the motion is decelerated, i.e. it 
begins to move with respect to a noninertial frame of reference 
ntlached to the coach. But no horizontal forces act on the ball 
when it deviates. 

7.2.2 The fundamental law of dynamics for a particle in noniner- 
lial frames of reference can be derived from Newton's second 
law and the relation between the absolute and relative accel- 
erations of a particle. It follows from Sect. 7.1.3 that the product 
of the mass m of a particle by its relative acceleration equals 


ma, = ma — may — mac. 
According to Newton’s second law, formulated to apply to the 


absolute motion of a particle, i.e. its motion with respect to 
inertial frame K, 


ma — F, 
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where F is the vector sum of all forces acting on the particle. 
Ience, the fundamental equation of dynamics of a particle's 
relative motion is of the form: 


ma, = F — ma, — mac. 


This equation can be reduced to a form similar to the funda- 
mental law of dynamics for the absolute motion of a particle: 


ma, = F + h + Ie. 


The vector quantities I, = --may and Iç = —mac have the 
dimensionality of force and are called the inertial force of trans- 
portation and the Coriolis inertial force, respectively. 

7.2.3 It follows from Sect. 7.1.3 that in the general case the 
inertial force of transportation is the sum of three terms: 


dt dt 
The last term on the right-hand side of this equation 
I.t = —m [Q [Qr]] 


is called the centrifugal force of inertia, or simply the centrifugal 
force, because this vector is perpendicular to the instantaneous 
axis of rotation (to vector Q) of the noninertial frame S and 
its direction is away from this axis. Numerically the centri- 
fugal force equals 


keem e Ee r |—m[2 [2r]. 


Top = mQ?0, 


where p is the distance from a particle of mass m to the instan- 
taneous axis of rotation of the frame of reference. 

The inertial force of transportation coincides with the centri- 
fugal force when the noninertial frame of reference travels 
with translational motion at constant velocity (vọ = const) 
and rotates at constant angular velocity (Q = const). 

7.2.4 The Coriolis inertial force is 


Ic = 2m [v Q]. 


This force acts on the particle only when the noninertial frame 
of reference rotates and the particle moves with respect to 
this frame. Thus, for example, particles of water in rivers 
of the Northern Hemisphere that flow meridionally are subject 
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to the action of a Coriolis inertial force. This force is perpen- 
dicular to the velocity of flow of the river and leads to under- 
mining the right bank in the direction of flow. 

The Coriolis inertial force does no work in the relative motion 
of a particle because the force is perpendicular to the velocity 
of relative motion of the particle. Hence, the Coriolis inertial 
force is an example of the gyroscopic forces (3.1.7). 

7.2.5 The action of inertial forces on a particle in a noninertial 
frame of reference are real and can be measured in this frame 
by, for example, a spring dynamometer. But, in contrast 
to ordinary forces of interaction between bodies, it is impossible 
lo indicate for inertial forces the action of what specific bodies 
on the particle being considered they express. This feature of 
inertial forces is due to the fact that the vefy occurrence of the 
vector quantities I, and Iç in the fundamental equation of the 
dynamics of relative motion results only from the noninertial 
nature of the frame of reference used to describe the relative 
motion of the particle. The addition of the inertial forces I, 
and Ig to the force F, which characterizes the action on the 
particle of all the other bodies, enables the fundamental ‘equa- 
tion of the dynamics of relative motion to be written in a form 
that resembles the expression for Newton’s second law in an 
inertial frame of reference. l 
There can be no closed systems of bodies in noninertial frames 
of reference because the inertial forces are always external 
lorces for any of.the bodies in the system. Therefore, the laws 
of conservation of momentum (2.7.1), of angular momen- 
lum (4.4.1) and of energy (3.4.2) are not valid in noninertial 
lrames of reference. 


7.3 Relative Motion in a Frame of Reference Fixed 
fo the Earth. Gravity Force and Weight of a Body 


7.3.1 A frame of reference fixed in the earth is noninertial 
lor two reasons: firstly, due to the diurnal, or daily, rotation 
of the earth at constant angular velocity Q (where Q = 27 
rad per 24 h = 7.3 X 10-5 rad/s) and, in the second place, 
duc to the effects of the gravitational field of the sun, moon, 
planets and other astronomical bodies on the earth. This gravi- 
tational field is practically uniform within the limits of the 
earth and imparts the same acceleration of translational motion 
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to the terrestrial frame of reference and all bodies moving with 
respect to this frame. This acceleration is ay = dv,/dt= gf, 
where gr is the intensity of the field (6.2.4). 

It follows from the equations in Sects. 7.2.2 and 7.2.3 that 
the equation of relative motion of a particle of mass m in a 
frame of reference fixed to 
the earth is of the form 


ma, = F + Fe, + Lett Ic, 


where I, and Ic are the cen- 
trifugal and Coriolis inertial 
forces, Fg, is the gravitation- 
al attraction of the earth 
for the particle (6.1.3), and 
F is the sum of all the 
other forces acting on the 
particle, with the exception 
of the gravitational forces. 
7.3.2 The gravity force of a 
Fig. 7.2 body is the force P, applied 

to the body and equal to 

the vector sum of the force 
Fy, of gravitational attraction of the earth for the body (6.1.3) 
and the centrifugal inertial force I,¢ due to the diurnal 
rotation of the earth (Fig. 7.2): 


= Fer + kst, 


i.e. 


mM eg 


r3 


P= —G r— m [Q [Qr]j-: 


Here m and My are the masses of the body and the earth, 
r is the radius vector drawn from the centre of the earth to the 
place where the body is located, Q is the angular velocity of 
diurnal rotation of the earth, and G is the gravitational con- 
stant (6.1.1). 

The gravity force causes an unsupported or unfastened body 
to fall to the earth. It equals the force with which the body 
presses against a horizontal support (or pulls on a vertical 
suspension) due to the gravitation attraction of the earth. 
It can be measured in the terrestrial frame of reference by 
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means of a spring dynamometer. The point of application 
of the gravity force acting on a body, i.e. the point of appli- 
cation of the resultant of all the gravity forces acting on all 
the particles of the body is called the centre of gravity of the 
body. It coincides with its centre of mass (2.3.3). 

7.3.3 The gravity force of a body is independent of its velocity 
of relative motion. It is proportional to the mass m of the body 
and can be presented in the form 


P = mg, 


where g is the acceleration due to gravity, or free fall acceleration 
(see 7.3.5). At a given place on earth, vectof g is the same for 
all bodies and changes for other places: 

The gravity force of a body coincides with the gravitational 
attraction of the earth for the body only at the earth’s North 
and South poles because there the centrifugal inertial force 
Ief = 0. The greatest difference between the gravity force and 
the gravitational attraction is observed at the equator, where 
the force Ief reaches its maximum value and is opposite in 
direction to Fg». Even at the equator, however, the gravity 
force and the force of gravitational attraction differ by only 
0.35 per cent. At all points of the carth’s surface, except the 
equator and the poles, forces P and Fe; fail to coincide in direc- 
tion as well (Fig. 7.2), but the maximum angle between them 
does not exceed 6 minutes of arc. The gravity force decreases 
with the altitude. Near the earth’s surface this decrease is 
approximately 0.034 per cent per kilometre above the surface. 
7.3.4 The acceleration g near the earth’s surface varies from 
9.78 m/s? at the equator to 9.83 m/s? at the North and South 
poles. This is due, in the first place, to the dependence of the 
centrifugal inertial force on the geographical latitude of the 
location and, in the second, to the nonspherical shape of the 
earth, which is slightly flattened at the poles of its axis of 
rotation, having the shape of an ellipsoid of rotation, or sphe- 
roid (the polar and equatorial radii of the earth are Ry, = 
= 6357 km and R, = 6378 km). The standard value of free 
fall acceleration, accepted in establishing a system of units 
and for barometric computation, is 9.80665 m/s?. 

7.3.5 The free fall of a body is its motion due to the action of 
the gravitational field alone. 

The free fall acceleration of a body falling to the earth, mea- 
sured in a noninertial frame of reference rotating together 
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with the earth, can be found by the motion equation (7.3.1) 
by putting F = 0, Fg, +- Ief = mg and Iç = 2m [v,Q]. Then 


ap = g +4- 2 [v,Q]. 
If v, = 0, a, = g. 


Consequently, vector g is equal to the free fall acceleration 
of the body, measured with respect to the terrestrial frame of 
reference at the instant when the relative velocity of the body 
equals zero. For this reason, vector g is called the free fall 
acceleration. 

If the relative velocity of a freely falling body v, = 0, its 
acceleration with respect to the earth is not g: g = a, + 2 [Qv,]. 
At velocities v, < 680 m/s, however, the values of g and a, 
differ by less than one per cent. It can therefore be assumed 
in many cases that with respect to an observer on the earth 
free fall of a body is caused only by the gravity force acting 
on and imparting the acceleration g to the body. In a similar 
manner the action of the Coriolis inertial force on a freely falling 
body can be treated as a comparatively small perturbation. 
Thus, for instance, a freely falling body is deviated by the 
Coriolis force to the east of the direction assumed by a plumb 
line, i.e. from the direction of the vector P = mg. This devia- 
tion for a body falling freely without any initial velocity from 
the height k is equal, at the latitude @, to 


sa Qh y Z COS (—p. 


If, for example, k = 160 m and @ = 45°, s = 1.55 cm. 
7.3.6 The weight of a body is the’ force Q with which it acts, 
due to the earth’s gravity, on a support or suspension that 
prevents it from falling freely. It is assumed that the body 
and its support (or suspension) are fixed with respect to the 
frame of reference in which the weight is determined. The 
force —Q is exerted by the support or suspension on the body. 
It follows from the fundamental equation of dynamics of rela- 
ive motion (7.2.2), where a, = a¢ = 0 and F = Fg, —Q, 
that 


Q= Fe, + i, 
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where Fey is the force of gravitational attraction of the earth 
for the body, and I, is the inertial force of transportation due 
to the noninertial nature of the frame of reference. 
Example 1. The weight of a body in a frame of reference fixed 
to the earth is equal to the gravity force of the body (7.3.2): 


Q = Fy, + Ket = P. A 
xample 2. The weight of a body ina frame of reference at- 


tached to a lift (elevator) which travels with translational motion 
with respect to the earth with the acceleration agọ: 


Q = P — ma). 


If the lift falls freely, a) = g, the free fall acceleration, and 
the weight of the body in the lift is Q = 0, i.e. the body is 
in the state of weightlessness. 

7.3.7 Weightlessness, or zero gravity, is the state of a mechan- 
ical system in which the external gravitational field acting 
on the system does not cause any pressure of the parts against 
one another or their deformation. Such a state occurs in a me- 
chanical system complying with the following three conditions: 
(a) no other external forces, except those of the gravitational 
field, act on the system; (b) the size of the system is such that 
within its limits the external gravitational field can be regarded 
as being uniform; and (c) the system is in translational motion. 
The state of weightlessness is characteristic, for instance, of 
bodies in a spaceship because, during the main part of its 
path in a gravitational field, it is in free flight, i.e. coasting 
with the engines turned off. 


7.4 Principle of Equivalence 


7.4.1 The inertial forces acting on bodies in a noninertial 
frame of reference are proportional to the masses of the bodies 
and, other conditions being equal, impart the same relative 
accelerations (7.1.3) to them. In other words, all bodies free 
of external action travel in an “inertial force field”, i.e. with 
respect to a noninertial frame of reference, in absolutely the 
same way, provided that the initial conditions of their motion 
are also the same. A similar behaviour is observed for bodies 
travelling with respect to an inertial frame of reference and 
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subject to the action of the forces of a gravitational field. At 
each point in the field these forces, like the inertial forces, are 
proportional to the masses of the bodies and impart the same 
free fall acceleration to them. This acceleration is equal to 
the field intensity (6.2.1) at the point being considered. 
In a noninertial frame of reference, for example, fixed to a lift 
(elevator) travelling with uniform acceleration vertically 
upward at the acceleration of transportation a, = const, all 
free bodies fall in the absence of a gravitational field with the 
same relative acceleration a, = —aọ. Free bodies behave in 
exactly the same way in the same lift when it travels at uni- 
form velocity in a uniform gravitational field of the intensity 
g = —a,. Thus, on the basis of experiments conducted on the 
free fall of bodies within a tightly closed lift, it is impossible 
to establish whether the lift is travelling at uniform velocity 
in a gravitational field of the intensity gf = a, (in partic- 
ular, the lift may also be at rest in this field) or whether it 
A travelling with constant acceleration of transportation 
= —a, in the absence of a gravitational field. 
7 4.2 The local equivalence principle states that the gravita- 
tional field in a limited region of space is equivalent to an 
“inertial force field” in a properly chosen noninertial frame 
of reference. The region of space should be small enough so that 
the gravitational field in it can be assumed to be uniform. 
The equivalence principle is not to be understood as a state- 
ment of the identity of inertial forces and Newton's gravita- 
tional forces that act between bodies. As a matter of fact, the 
intensity of a true gravitational field, set up by bodies, decreases 
as the bodies recede farther and farther away, and approaches 
zero at infinity. Gravitational fields “equivalent” to iner- 
tial forces do not comply with this condition. For example, 
the intensity of a gravitational field, “equivalent” to the centri- 
fugal inertial forces in a rotating frame of reference increases 
without limit at points farther and farther away from the axis 
of rotation. The intensity of a field “equivalent” to the inertial 
forces of transportation in a frame of reference having transla- 
tional motion is the same at al! points of the field. 
7.4.3 A true gravitational field, in contrast to one “equivalent” 
to inertial forces, exists in both noninertial and inertial frames 
of reference. No ‘possible choice of a noninertial frame of ref- 
erence can completely climinate a true gravitational field, 
i.e. compensate for it throughout space by means of a “field 
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of inertial forces”. This follows in any case, even if only from 
the different behaviour of “fields of inertial forces” and true 
oravitational fields at infinity. A true gravitational field can 
be eliminated only locally, i.e. for a small region of space 
within whose limits the field can be assumed uniform, and for 
a short time interval during which the field can be assumed 
constant. The noninertial frame of reference corresponding to 
this operation should travel with an acceleration of transpor- 
tation equal to the free fall acceleration of bodies in the region 
of the true gravitational field being considered. Thus, in a space 
ship in free flight in a gravitational field, the gravitational 
forces of attraction are compensated for by the inertial forces 
of transportation and do not lead to the relative motion of 
bodies in the ship. ° 


PART TWO FUNDAMENTALS 
OF MOLECULAR PHYSICS 
AND THERMODYNAMICS 


CHAPTER 8 IDEAL GASES 


8.1 Subject Matter of Molecular Physics. 
Thermal Motion 


8.4.4 The branch of physics called molecular physics deals with 
the dependence of the structure and physical properties of 
bodies on the nature of the motion of and the interaction be- 
tween the particles making up these bodies. 

Molecular physics is based on the molecular-kinetic theory 
of the structure of matter. According to this theory all bodies 
consist of submicroscopic particles—atoms, molecules or ions— 
in continuous chaotic motion, which is called thermal motion. 
Experimental confirmations of the molecular-kinetic theory 
are: the Brownian movement (11.7.1), the transport phenomenon 
in various aggregate states of matter and other phenomena. 
8.1.2 The molecular-kinetic theory of the structure of matter 
is used to advantage in various branches of physics. A great 
variety of physical phenomena that proceed due to the interac- 
tion and motion of particles of matter are dealt with in this 
theory from a unified point of view. This theory, for example, 
enables the mechanism of elasticity in solids (40.3.1) to be 
understood, reveals the cause of internal friction in gases (10.8.4) 
and liquids, explains the difference between real gases (42.1.2) 
and ideal gases (8.4.1), and elucidates the mechanism of elec- 
trical conduction in conductors of electric current of various 
kinds, and the electrical and magnetic properties of matter. 
8.1.3 The thermal motion of particles of matter in various 
states of aggregation is not the same. It depends upon the 
forces of attraction or repulsion between the atoms, molecules 
and ions. 

There are practically no attractive forces between the atoms 
and molecules in sufficiently dilute gases. This is due to the 
fact that the particles of such gases are located at distances 
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from one another exceeding the range of molecular action 
(12.2.3). The particles of such gases travel at uniform velocity 
and in a straight line until they collide with one another or 
with the walls of the container. These collisions are of a random 
nature. Each molecule in the volume of the gas may collide 
with any of the nearby particles, changing the direction of its 
motion in any arbitrary manner. The collision of a molecule 


(or atom) of the gas with a wall may occur at any angle 0 ( where 


T 1 
i 
of gas is a purely random one and, on the average, the same 
number of molecules are travelling at any instant of time in 
any arbitrary direction within the gas. 
8.1.4 Solid crystalline bodies are characterized by considerable 
forces of interaction between their particles (atoms, molecules 
and ions). The joint action of the forces of attraction and repul- 
sion between the particles (12.1.3) leads to the vibration of 
the particles of solids about their equilibrium positions, which 
are called the sites, or points, of the crystal lattice (40.1.4). 
Intermolecular interaction and violations of periodicity in 
crystals (40.1.1) make these vibrations anharmonic (28.1.3). 
8.1.5 The thermal motion of molecules of a liquid is of a kind 
intermediate between the two preceding types of motion (8.1.3 
and 8.1.4). For a definite length of time the molecules of a liquid 
vibrate about a.certain equilibrium position and are in a tem- 
porarily settled state. At the end of this time the equilibrium 
position of the molecules is displaced and a new settled arrange- 
ment is formed. Simultaneously, a slow displacement of the 
molecules and their vibrations occur within small volumes 
of the liquid (see also Sect. 13.2.3). 


). Asa result, the thermal motion of molecules 


8.2 Statistical and Thermodynamic Methods 
of Investigation 


8.2.1 There is a huge number of atoms (or molecules) in any 
body. For instance, in 1 cm? of a gas whose properties are close 
to those of an ideal gas there are 2.7 X 10'® molecules at stand- 
ard conditions. In the condensed states, liquid and solid, 
the amount is of the order of 102% particles/em. If we assume 
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that each atom (or molecule) obeys Newton's second law (2.4.1) 
it is obviously quite impossible, not only to solve the dif- 
ferential equations of motion of the separate particles, but 
even to write these equations. Therefore, the behaviour of 
a separate molecule (or atom) of a body, for example, its path 
and the sequence of changes in its state, cannot be investigated 
by the methods used in classical mechanics. 

8.2.2 The macroscopic properties of systems consisting of a very 
great number of particles are studied by the statistical method. 
It is based on the application of probability theory to definite 
models of the structure of the systems being investigated. 
The branch of theoretical physics in which the physical prop- 
erties of systems are studied by the statistical method is 
called statistical physics (statistical mechanics). Special laws, 
known as statistical laws are manifested in the joint behaviour 
of large numbers of particles. Certain average values of the 
physical quantities can be found for a system consisting of 
a great many particles. These values characterize the set of 
particles as a whole. In gases, for instance, there are average 
values of the velocity of thermal motion of the molecules (10.3.6) 
and of their energy (10.2.4). In a solid there is an average energy 
per degree of freedom of the vibrational motion of its particles 
(41.7.2), etc. All properties of a system of particles are due, 
not only to the individual properties of the particles themselves, 
but to the features of their joint motion and average values of 
their dynamic characteristics (average velocities, average 
energies, etc.). 

In addition to statistical laws, there are also dynamical laws, 
describing the motion of separate particles. The relationship 
between dynamical and statistical laws is manifested in the 
fact that the laws of motion of individual particles have an 
effect of the description of a system of particles, which is being 
investigated by the statistical method. 

8.2.3 Another procedure for studying physical phenomena 
is the thermodynamic method, which is not concerned with 
either the internal structure of the substances making up the 
bodies (or systems) being investigated or with the nature of 
the motion of the separate particles. The thermodynamic 
method is based on the study of the various conversions of 
energy that occur in a system. The conditions of these conver- 
sions and the relations between the various kinds of energy 
enable the physical properties of systems to be studied in 
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a great variety of processes in which the systems participate. 
The branch of physics in which research in the physical prop- 
erties of systems is conducted by means of the thermodynamic 
method is called thermodynamics (phenomenological thermody- 
namics). Thermodynamics is based on two experimentally 
established principles, or laws, of thermodynamics (9.3.4 and 
11.3.2), as well as the Nernst heat theorem, or third law of 
thermodynamics (11.8.2). 


8.3 Thermodynamic Variables. Equations of State. 
Thermodynamic Processes 


8.3.1 Thermodynamics deals with thermodynamic systems. There 
are macroscopic items (bodies and fields) that can interchange 
energy with one another as well as with the external medium, 
i.e. bodies and fields that are external with respect to the 
given system. 

8.3.2 The physical quantities introduced to describe the state 
of a thermodynamic system are called thermodynamic variables, 
or parameters (state variables), of the system. Usually the pres- 
sure, specific volume and temperature are taken as the thermo- 
dynamic variables. 

The pressure p is a physical quantity equal to the force acting 
on unit area of a body’s surface in the direction normal to 
the surface: 


_ dF y 
~ dA? 


where dF, is the numerical value of the normal force acting 
on an infinitely small part of the surface of the body, the area 
of this part being dA. 

The specific volume v is the reciprocal of the density p of a body: 
v = 1/o. The specific volume of a homogeneous body is equal 
to the volume of unit mass of the body. 

8.3.3 The concept of temperature is of significance for equi- 
librium states of a thermodynamic system (8.3.4). An equilib- 
rium state (state of thermodynamic equilibrium) is a state of 
a system which does not change with time (steady state), the 
steady nature of the state not being due to processes occurring 
in the external medium. An equilibrium state is set up in 


J—0186 


130) IDEAL GASES f$ 


a system subject to constant external conditions and is main- 
tained in the system for an arbitrary length of time. The tem- 
perature is the same in all parts of a thermodynamic system 
that is in a state of thermodynamic equilibrium. If two bodies 
with different temperatures come into contact, internal ene- 
by (9.1.2) is transferred from the body at the higher temperature 
to that at the lower temperatuie. The process ceases when the 
temperatures of the two bodies become equal. 

8.3.4 The temperature of an equilibrium system is a measure 
of the intensity of thermal motion of its molecules (or atoms 
or ions). In an equilibrium system of particles that obeys the 
laws of classical statistical mechanics (8.2.2), the average 
kinetic energy of the thermal motion of the particles is pro- 
portional to the absolute temperature (10.2.5). 
Temperature can only be measured indirectly, making use 
of the fact that many physical properties of bodies, lending 
themselves to direct measurement, depend on the temperature. 
A change in the temperature of a body leads to changes in its 
length and volume, density, electric resistance, elastic prop- 
erties, etc. A change in any of these properties can be used 
to measure temperature. For this purpose, it is necessary for 
a certain body, said to be temperature-indicating, that the 
functional dependence of the given property on the temperature 
be known. Temperature scales based on temperature-indicating 
bodies are said to be empirical. 

In the international centigrade (Celsius) scale, temperature is 
measured in °C and is denoted by ¢ (degree centigrade or degree 
Celsius). It is assumed that at a standard pressure of 1.01325 X 
x 10° Pa (Appendix I) the melting point of ice and the boiling 
point of water are equal to 0°C and 100°C. 

In the thermodynamic (absolute) temperature scale, the tem- 
perature is measured in kelvins (K) and is denoted by T. 

The relation between the absolute temperature and the tem- 
perature measured by the Celsius scale is: T = 273.15 + 1. 

The temperature T = 0 (or t = —273.15°C) is called absolute 
zero temperature (the impossibility of reaching absolute zero 
is discussed in Sect. 11.8.4). 

8.3.5 The state variables of a system are classified as being 
either external or internal. External state variables are physical 
quantities depending upon the position in space and various 
properties of bodies that are external with respect to the given 
system. For a gas, for instance, the volume V of the vessel in 
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which the gas is contained is an external variable because the 
volume depends upon the position of external bodies: the 
walls of the vessel. For a dielectric in an electric field, the 
intensity of the field, set up by certain external sources, is an 
external variable. If a liquid is held in an open vessel, the 
atmospheric pressure, for example, is an external variable. 
The internal variables of a system are physical quantities that 
depend both on the position of bodies external with respect 
to the system, and on the coordinates and velocitites of the 
particles that make up the system. Internal variables of a gas, 
for instance, are its pressure and energy because they depend 
upon the coordinates and velocities of the moving molecules 
and on the density of the gas. 

8.3.6 The state variables of a system that is in equilibrium 
(8.3.3) are not independent. The internal variables of such 
a system depend only on its external variables and on the 
temperature. The equilibrium state of a simple. system* of 
given chemical composition and mass M is determined by 
specifying two variables: the volume V and the temperature T. 
The equation of state (thermal equation of state) of a simple 
system is the functional dependence of the equilibrium pressure 
p in the system on the volume and temperature 


p = Í (V, T). 


The equation of state is obtained experimentally in thermo- 
dynamics. In statistical physics (8.2.2), it is derived theoret- 
ically. This constitutes the interrelation between the statis- 
tical (8.2.2) and thermodynamical (8.2.3) methods of investi- 
gation. 

8.3.7 If any of the external variables of the system is changed, 
the state of a thermodynamic system is changed. This is called 
a thermodynamic process. A thermodynamic process is said 
lo be equilibrium (an equilibrium, quasi-static process) if the 
system passes at an infinitely slow rate through a continuous 
succession of infinitely close thermodynamic equilibrium 
slates (8.3.3). All processes that do not comply with these 


* An example of a simple system is a gas in the absence of 
cxternal fields. Mixtures of chemically homogeneous gases of 
constant composition, chemically pure liquids, etc. are also 
simple systems. 


9* 
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conditions are said to be nonequilibrium ones. Real processes 
are nonequilibrium ones because they proceed at a finite rate. 
But the slower they proceed, the closer they are to equilibrium 
processes. 

Isoprocesses are thermodynamic processes that proceed in 
a system of constant mass with a constant value of one of the 
state variables. 

An isothermal process proceeds at constant temperature (T = 
= const). 

An isochoric (isovolumic) process proceeds at constant volume 
(V = const). 

An isobaric process proceeds at constant pressure (p = const). 
An adiabatic process is a thermodynamic process that occurs 
in a system without heat exchange (9.2.4) with external bodies 
(see also Sect. 9.5.10). 

8.3.8 State functions are physical quantities characterizing the 
stafe of a system. Variation in the state functions during ther- 
modynamic processes is independent of the kind of process 
that occurs. State functions are uniquely determined by the 
values of the variables of the initial and final states of the 
system. The simplest state functions of a system are its inter- 
nal energy U (9.1.2) and its entropy S (11.4.2). 


8.4 Equation of State of an Ideal Gas 


8.4.1 An ideal, or perfect, gas is one whose molecules do not 
interact with one another at a distance and are of vanishingly 
small size. In real gases (12.1.2) the molecules are subject to 
the forces of intermolecular interaction (42.1.3). 

In collisions with one another and in striking the walls of 
the vessel the molecules of an ideal gas behave as perfectly 
elastic spheres (1.1.4) of diameter d (effective diameter of the 
molecule), which depends upon the chemical nature of the gas. 
The existence of an effective diameter d (where d ~ 10-} m) 
signifies that forces of mutual repulsion (12.1.3) act between 
the molecules. Intermolecular forces of attraction (42.1.3) 
decrease drastically with an increase in the distance r between 
molecules and are practically negligible at Rm > 10-9 m. 
In real gases at low densities the average distance (r) between 
molecules exceeds Rm and, to a good approximation, they can 
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be regarded as ideal. Hydrogen, helium, oxygen and nitrogen 
are assumed to be ideal gases at densities corresponding to 
standard conditions. 

8.4.2 For a given mass of an ideal gas, the ratio of the product 
of the numerical values of pressure and volume to the absolute 
temperature is a constant value (Clapeyron equation): 


pV 
T cons 


The numerical value of the gas constant C depends upon the 
choice of the units of measurement of p, V and T. 

If v is the specific volume of the gas (8.3.2) and M is its mass, 
then V = Mv and the Clapeyron equation assumes the form 


C 
pv = -y 1 58T, 


where B = C/M is the specific gas constant referred to unit 
mass. 

8.4.3 From the definition of the mole (Appendix I), it is evident 
that moles of different gases contain the same number of mole- 
a This is called Avogadros number Ny, (Appen- 
dix II). 

The molar mass u of a gas or any body is the physical quantity 
cqual to the ratio of the mass of the gas (or any body) to the 
number of moles N it contains: u = M/N. 

The molar mass is proportional to the relative mass of the 
molecules of a gas: u = 10- m/m,, where m is the mass of 
the molecules of the given gas and mọ is the atomic mass unit 
based on the carbon scale (Appendix II). The coefficient 10- is 
required because molar mass is measured in kg/mol in the SI 
(International System of Units). 

The molar volume Vy is the physical quantity equal to the 
ratio of the volume V of a gas to the number of moles N it 
contains: me = V/N. The mass of a mole is numerically equal 


lo u and therefore V, = uv, where v is the specific volume 
(8.3.2). j 

8.4.4 The equation of state for a mole of ideal gas is 

PVu 


FT =B =R or PV =RT. 
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Here R is the universal gas constanti, being the gas constan- 
per mole of gas. The universal nature of R follows from Avot 
gadro’s law, which states that moles of all ideal gases occupy 
the same volume at the same pressure and the same tempera- 
ture. Under standard conditions (T = 273.15 K and p = 
= 1.0132 x 10° Pa = 1 atm = 760 mmHg) a mole of any 
gas has a volume of V, = 22.415 x 10-3 mê. From this we can 
calculate the numerical values of R for various systems of 
units (Appendix II). 

If a volume V of gas has the mass M kg, i.e. if it contains M/p 
mořes, then V = M/u X V, and the equation of state of the 
gas assumes the form called the Mendeleev-Clapeyron equation: 


M 
yv =M par. 
P =g 


8.4.5 Boltzmann's constant k is a physical quantity equal 
to the ratio of the universal gas constant R to Avogadro's 
number N 4, i.e. k = R/N a. The values of k in various systems 
of units are listed in Appendix II. 
The equation of state of an ideal gas, expressed by means of 
Boltzmann’s constant, is of the form 

O kNA 


p= Vu =hknol, 


where no = Na/V, is the number of gas molecules in unit 
volume (molecule concentration). 

At constant temperature, the pressure of a gas is proportional 
to the concentration of its molecules (or to the density of the 
gas). 


CHAPTER 9 FIRST LAW OF THERMODYNAMICS 


9.1 Total and Internal Energy of a System 


9.1.1 An arbitrary thermodynamic system (8.3.1), in any 
thermodynamic state (8.3.3), has a total energy E, which is 
the sum of: 


(a) the kinetic energy °C” f mechanical motion of the system 
as a whole (or of its macroscopic parts); 
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(b) the potential energy Et (3.3.1) of the system in all external 
force fields (for instance, electromagnetic or gravitational 
fields); and 

(c) the internal energy U: 


[iss gme |. ext U. 


k 


9.1.2 The internal energy of a body or of a thermodynamic 
system (8.3.1) is the energy depending only upon the thermody- 
namic state of the body (or system). For a fixed (stationary) 
system, not located in any external force fields, the internal 
cgergy coincides with the total energy. The internal energy 
also coincides with the rest energy of a body (or system) (5.7.3) 
and includes the energy of all the types of internal motion of 
a body (or system) and the energy of interaction of all the 
particles (atoms, molecules, ions, etc.) making up the body 
(or system). 

lor example, the internal energy of a gas consisting of poly- 
atomic molecules (ammonia, carbon dioxide, etc.) consists of: 
(a) kinetic energy of thermal translational and rotary motion 
of the molecules; 

(b) kinetic and potential energy of vibration of the atoms and 
molecules; 

(c) potential energy due to intermolecular interactions; 

(d) energy of the electron shells of the atoms and ions; 

(e) kinetic energy and potential energy of interaction of the 
nucleons (42.1.1) in the nuclei of the atoms. 

Terms (d) and (e) do not ordinarily change in processes that 
proceed at not very high temperatures so that ionization and 
excitation play no essential role. Under these conditions, 
terms (d) and (e) are not taken into account in the internal 
cnergy balance. Term (c) is also neglected for an ideal gas 
(8.4.1). 
9.1.3 The internal energy is a single-valued function of the 
thermodynamic state of a system. The value of the internal 
energy in any state is independent of the process by means of 
which the system reached this state. The change in internal 
energy in the transition of a system from state 1 to state 2 equals 
AU = U, — U and is independent of the kind of transition 
| — 2 that occurs. If the system accomplishes a cycle (11.1.1) 
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the total change in its internal energy is equal to zero: $ dU = 


As is well known, such a relation mathematically signifies 
that an element of change in the internal energy dU is a com- 
plete (exact) differential. This property is also possessed by the 
entropy (11.4.2) and other state functions (8.3.8) (cf. Sect. 9.2.5). 
9.1.4 In a system that is in a state of thermodynamic equilib- 
rium the internal energy depends only upon the temperature 
and the external variables (8.3.5). In particular, for a simple 
system (8.3.6) of constant mass M, the internal energy is a func- 
tion of the temperature T and volume V of the system (calorific 
equation of state of a simple system): 


U = ọọ (V, T). 


Example 1. The internal energy of an ideal gas (8.4.1) depends 
only on its absolute temperature and is proportional to the 
mass M of the gas: 


T T 
U= | CvaTr+0 =M (| ev aru] ’ 
0 () 


where Cy and cy = Cy/M are the heat capacity (9.5.1) and 
specific heat (9.5.2) of the gas at constant volume (in an isochoric 
process) (8.3.7), ug = U,/ M is the internal energy of unit 
mass of the gas at T = 0 K. For monatomic gases at ordinary 
temperatures Cy, is independent of T, and U = CyT + Uo. 
Example 2. The internal energy of a van der Waals gas (12.2.1) 
is: 

T 

| 2 


U=) CyaT——, 


a 

i y T Uo 
0 

where M is the mass of the gas, u is its molar mass (8.4.3), 

and a is the van der Waals’ coefficient (12.2.3). 

9.1.5 In thermodynamics, the internal energy is determined to 


an accuracy of the constant term U,, whose value depends 
upon the choice of the reference point for the quantity U, 
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i.c. from a state with zero internal energy. The quantity U, 
plays practically no role in thermodynamical calculations 
in which the changes AU of internal energy, independent of 
Uy, are determined (see also Sect. 8.1.2). 


9.2 Heat and Work 


9.2.1 An energy exchange between a thermodynamic system 
and external bodies takes place in two ways: either by doing 
work or by means of heat exchange.* The amount of energy 
transferred to the system by external bodies in force inter- 
action between the system and the bodies is called work done 
on the system. The-amount of energy transferred to the system 
by external bodies by heat transfer is called the quantity of 
heat imparted to the system.** 

9.2.2 If the thermodynamic system is fixed (stationary), a neces- 
sary condition to do work is displacement of the external bodies 
interacting with it, i.e. it is necessary to change the external 
state variables of the system (8.3.5). In the absence of external 
force fields, energy exchange between the fixed system and 
the external medium may take place by doing work when the 
volume and shape of the system are changed. According to the 
law of conservation of energy, the work W’ done on the system 
by external forces is numerically equal and opposite in sign 
to the work W that the system does on the external medium, 
i.c. against the external forces: 


W = —W. 


9.2.3 Expansion work is the work done by the system in over- 
coming the external pressure. An element of expansion work 
SW = pext dV, where pe; is the uniformly distributed exter- 
nal pressure, and dV is an element of change of the system’s 
volume. If the expansion process is an equilibrium (quasi- 


* A discussion of the third way of energy exchange—mass trans- 
ler—is beyond the scope of the present handbook. 

** Sometimes the term “heat” is employed in place of the term 
“quantity of heat” if this leads to no confusion. It should be 
indicated, however, that heat, generally speaking, is to be under- 
stood as a form of energy transfer (9.2.4). . 
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static) one (8.3.7), Pext = Pp, Where p is the pressure in the 
system. Then OW = p dV. 

The significance of the different ways used in denoting the 
elements of change ÔW and dU is explained in Sects. 9.4.3 
and 9.1.3. The work done in equilibrium expansion of system 
from volume V, to volume Vy, is 


Ve 
Vi 


Example. The work done by a gas in a vessel with a weightless 
movable piston (Fig. 9.1). The pressure of the gas p > O and, 
therefore, in its expansion (dV > 0) the gas performs positive 
work (ôW > 0). In compressing the gas (dV< 0 
and 6W < 0) the positive work is done on the 
Fort gas by the forces of external pressure. Here 
the gas does negative work. 
The graphical representation of work is dis- 
cussed in 9.4.2. 
9.2.4 Heat exchange takes place between 
bodies (or parts of bodies) heated to different 
temperatures. There are three types of heat 


exchange: convection, conduction and 
radiation (radiative heat exchange). 
Fig. 9.4 Convective heat exchange is the transfer of 


energy in the form of heat between the 

unequally heated parts of gases or liquids 
or between gases, liquids and solids. Convective heat exchange 
in liquids takes place by motion of parts of the liquid with 
respect to one another or with respect to solids. For instance, 
in a radiator of a hot-water heating system, energy of the hot 
water flowing through the radiator is transferred by convection 
to the less heated walls of the radiator. 
The phenomenon of heat conduction consists in the transfer of 
energy in the form of heat from one part of a nonuniformly 
heated body to another part. Thus, for example, energy is 
transferred through the walls of a heating radiator from the 
oe heated inner surfaces to the less heated outer sur- 
‘aces. 
Radiative heat exchange takes place without direct contact of 
the bodies exchanging energy and consists in the emission and 


9.2] HEAT AND WORK 139 


absorption of the energy of an electromagnetic field by the 
bodies. An immense amount of energy is delivered from the 
sun to the earth’s surface by radiative heat exchange. 

9.2.5 Work and heat are the energy characteristics of processes 
for changing the state of thermodynamic systems and are 
meaningful only when such processes occur. Depending upon 
the kind of process the system undergoes in its transition from 
state 1 to state 2, it is necessary to perform different amounts 
of work and to deliver different quantities of heat to the system. 
A comparison with Sect. 8.1.3 (on the change in the internal 
energy in processes) indicates that work and heat are not forms 
of energy and it is incorrect to speak of a “stockpile of work” 
or a “reserve of heat” in a body. For the same reasons, an elc- 
ment of the quantity of heat 6Q and an element of work ôW 
are not complete differentials (cf. Sect. 9.1.3). 

9.2.6 The performance of work on a system can change any 
type of its energy. For example, when a gas is rapidly com- 
pressed in a vessel by means of a moving piston (9.2.3), the 
work done by external forces on the gas increases the internal 
energy of the gas. In the inelastic collision of two bodies (3.5.3), 
a part of the work is expended to change the kinetic energy 
of the bodies (3.2.2) and a part to change their internal energy. 

If energy is delivered to a system in the form of heat, it is 
used only to increase the internal energy of the system. This 
is due to the fact that in any kind of heat exchange (9.2.4), 
energy is transferred directly between chaotically moving 
particles of the bodies. This changes their internal energy. 
In the process of heat conduction, for instance, in a nonuniform- 
ly heated rigid body the particles of the body located in its 
more heated parts transmit some of their energy to particles 
located in its less heated parts. The result is an equalization 
of the internal energies of the various parts of the body, an 
equalization of their temperatures and the discontinuation 
of the heat conduction process. 

It follows from the aforesaid that there is a qualitative dil- 
ference and lack of equivalence between work and heat as 
a form of energy transmission. Frequently, these two forms 
of transmitting energy exist simultaneously. For example, in 
heating a gas in a vessel with a moving piston, the volume of 
the gas is increased and work is done against the external pres- 
sure simultaneously. 

9.2.7 A thermodynamical system is said tu be closed (or isolated) 
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if there is no exchange of energy in any form between it and 
the external medium. Such systems comply with the law of 
conservalion of energy: the total energy of the system remains 
unchanged regardless of the processes that occur within it. 
9.2.8 A thermodynamical system is said to be adiabatically 
isolated (i.e. thermally isolated) if no heat exchange takes place 
between it and the external medium. Such a system can per- 
form work on external bodies. Morcover, external forces can 
do work on the system. An example of such a system is a cylin- 
der with a moving piston, filled with gas and surrounded all 
over by a dense layer of heatproof felt. The absence of heat 
exchange with the external medium does not exclude the possi- 
bility of the gas doing expansion work (9.2.3), or the external 
pressure doing compression work on the gas. 

The properties of a system approximate those of an adiabati- 
cally isolated one if the change in its state is so rapid that 
there is no time, during the process, for heat exchange with 
the external medium. (For example, rapid expansion of a com- 
pressed gas released from a gas cylinder when the outlet valve 
is opened for a short time.) 


9.3 First Law of Thermodynamics 


9.3.1 The first law of thermodynamics states that the change 
in the internal energy AU- of a system that occurs in the 
process 1 — 2 of transition of the system from state 1 to state 2 
is equal to the sum of the work Wi- done by external forces 
on the system and the quantity of heat Q,_, transmitted to the 
system: 


AUi- = Wi-2 + Qi-2; 


where Wi- = —W,_,, and Wi- is the work done by the 
system to overcome external forces in the process 1 — 2. Hence 


Qı-2 = A Uio ae Wy-2. 


The quantity of heat transferred to a system is expended in 
changing the internal energy of the system and in the work 
done by the system to overcome the externa] forces, 


0.4] GRAPHICAL REPRESENTATION {41 


For an element of the quantity of heat 6Q, an element of work 
SW and an infinitely small change dU of the internal energy, 
(he first law of thermodynamics assumes the form 


dV = dU 4- ôW. 

9.3.2 If Q > 0, then heat is being transferred to the system; 
if ÔQ < 0, then heat is being removed from the system. In the 
linite process 1 > 2, elements of the quantity of heat can have 
both signs, and the total quantity of heat Qi- in the process 


1 — 2 is equal to the algebraic sum of the quantities of heat 
lransferred during all the parts of this process: 


2 


Qi- = | dQ. 
1 

It is customary to consider 6W > 0 when the system performs 
work on external bodies, and 6W < 0 when work is performed 
by external bodies on the system. The work W,_, done by the 
system in the finite process 1 — 2 is equal to the algebraic sum 
of the amounts of work ôW performed by the system during 
all the parts of this process: 


2 
W 1-2 = \ ôW. 
1 


9.3.3 If the system for instance, the working medium (11.1.1) 
in a periodic- action engine completes a cycle 1— 1 (11.1.1), 
then AU,_; = 0 and W,_,; = Qi. It is impossible to build 
an operating engine that could perform more work than the 
energy transferred to the system from outside. Such an engine 
is called a perpetual motion machine of the first kind. The impos- 
sibility of constructing a perpetual motion engine of the first 
kind is also a statement of the first law of thermodynamics. 


9.4 Graphical Representation of Thermodynamic 
Processes and Work 
9.4.1 The equation of state (8.3.6) of a thermodynamic system 


enables the value of any state variable, for example p, to be 
determined from any values of two other variables, such as V 
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and 7’. Therefore, thermodynamic processes can be graphically 
represented in various two-dimensional coordinate systems. 
In addition to the most extensively used pV diagrams, pT 
and VT diagrams are also used. 

In Fig. 9.2, the thermodynamic process in the pV diagram 
is represented by the curve C,C,, and the points Cı (Pi, Vo) 
and Cs (ps, Va) indicate the initial and final states of the 
thermodynamic system. 

Only equilibrium processes (8.3.7) can be represented graphi- 
cally. State variables for the whole body (or system) cannot 
be specified for nonequilibrium processes (8.3.7) because they 
differ at different parts of the body (or system). Hence such 
graphical representation of nonequilibrium processes is im- 
possible. 

9.4.2 An element of work 6W done by a system (9.2.3) in an 
equilibrium process (8.3.7) can be measured by the area of the 


r" Fig. 9.2 


trapezoid having one curvilinear side, hatched in Fig. 9.2. 
The work W,- done by the system in the process CC, is equal to 


Vo 
Wi- = \ pdV 
Vi 


and is measured by the area that is bounded by the curve re- 
presenting process C,C,, the axis of abscissas and the ordinates 
pı and pz of points C; and Cy. The amount of work W,_, done 
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depends on how the system passes from state Cı to state Cs, 
i.c. upon the kind of process C,C, the system undergoes ih 
changing its state. In the pV diagram of Fig. 9.3 the amounts 
of work done by the system in the processés CyL,C,, CyL.C» 
ind C,L.C., and equal to Wri, W,,and W z,, respectively, 
are measured by three areas that differ in size: Wr, > Wy, > 
> W ts. , l 
After the system has completed the cycle (41.1.1) CiLiCaLC1 
the total amount of work Wg e,-¢, is not equal to zero. The 
positive expansion work in process C,L,C. exceeds the negative 
work done in the compression process CLC. The resultant 
positive work is measured by the hatched area in Fig. 9.3. 
9.4.3 The work W.and the quantity of heat Q are not state 
functions (8.3.8). In various processes 1— 2, in which the 
state of a system is changed, various quantities of heat are 
transferred to the system, and various amounts of work are 
done. Elements of the changes in the physical quantities 6Q 
and OW are not complete differentials (9.1.3). 


9.5 Heat Capacity of Matfer. Applying the First Law 
of Thermodynamics to Isoprocesses in an Ideal Gas 


9.5.1 The heat capacity C of a body is a physical quantity, 
numerically equal to the ratio of the quantity of heat òQ trans- 
lerred to the body to the change dT in the temperature of the 
body in the thermodynamic process being considered: 


,_ ô 
C= 7: 


The value of C depends upon the mass of the body, its chemical 
composition, thermodynamic state and the process in which 
(he heat 6Q is transferred to it. 

9.5.2 The specific heat c is the heat capacity per unit mass of 
a substance. For a homogeneous body c = C/M, where M is 
the mass of the body. 

The molar heat capacity C, is the heat capacity of a single mole 
(Appendix I) of the substance: C,, = wc, where p is the molar 
mass (8.4.3) of the substance. 
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9.5.3 The element of the quantity of heat 60 that must be 
transferred to a body to change its temperature from T to 


T+ AT is 
60 = C dT. 


For a homogeneous body 6Q = Mc dT = (M/u) Cy dT, where 
M is the mass of the body, u is its molar mass and M/w is the 
p number of moles in the body. 
2 9.5.4 For equilibrium (8.3.7) iso- 

processes in gases (8.3.7), the first 

law of thermodynamics (9.3.1) is 


7 of the form 
M 
~~ C, dT =dU +p av. 
3 u Ce F 
In the isochoric process (8.3.7) 
0 “—> of heating or cooling a gas (straight 
V lines 1-2 and 1-3 in Fig. 9.4), 
Fig 9.4 no element of work 6W = p dV 
eee is done (because dV = 0). The 


whole quantity of heat 6Q trans- 
ferred to the gas is expended in changing its internal ener- 
gy: ÔQ = dU. If Cyu is the molar heat capacity at constant 
volume, 


M 


Within a defnite temperature range it can be assumed that 
Cy x const (for greater detail see Sect. 10.7.4) and that the 
change in the internal energy of the gas AUj,_,, as its temper- 
ature is changed from 7, to T,,isequalto AUi- = U — U1 = 
= (M/u) Cy (Ta — Tı) and is due to the quantity of heat 
Q,-» transferred to the gas in the isochoric process: 


O11 = Cy, (T3 — T): (1) 


At T, > Ty, Qi- > 0 and a definite quantity of heat is trans- 
ferred to gas; at Ta < Ti, Qi- < 0 and a definite quantity 
of heat is conducted away from the gas. 
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9.5.5 For an ideal gas, equations (*) and (**) express the change 
in its internal energy in any process for changing the state of 
the gas in the temperature range (7,-7,). The internal energy 
of an ideal gas depends only 

on its chemical composition, 2 

mass and temperature. 

In real gases (12.1.2) the inter- 3 7 2 

nal energy includes the poten- 
lial energy of interaction be- 
tween the molecules '(9.1.2), 
which depends upon the dis- 
lance between them. This part 
of the internal energy of a g V y, Vy, vV 
rcal gas is changed with changes = 

in its volume. Therefore, 9.5 

equations (*) and (#*) express “S % 

the change in the internal 

energy of a real gas only when it is heated or cooled in an 
isochoric (constant-volume) process. 

9.5.6 In an arbitrary cquilibrium process (8.3.7) that takes 
place in an ideal gas, the first law of thermodynamics assumes 
the form 


M M 
p ae Cv dT -+-p dV, 


where C,, is the molar heat capacity of an ideal gas in the given 
process. The isobaric process (8.3.7) of heating (straight line 
1-2 in Fig. 9.5) or cooling (straight line 1-3 in Fig. 9.5) a gas 
can be carried out, for instance, in a vessel (cylinder) closed by 
a movable piston subject to a constant external pressure. An 
clement of work 6W done by an ideal gas in an isobaric process is 


SW =p ayes R aT, 
p 
: , ' M R 
in which use is made of the expression dV = — — dT from 


the Mendeleev-Clapeyron equation (8.4.4) at p = const. 
The universal gas constant R (8.4.4) is numerically equal 
to the work done by one mole of an ideal gas when it is heated 


10—0186 


146 FIRST LAW OF THERMODYNAMICS [9 


isobarically by one degree: 


R= ôW 
(M/u)dT ` 
The work W,_, done by the gas in isobaric expansion 14-2 is 
V2 
Wi-2 = \ p dV =p (Va— V1). 
V1 


It is represented by the hatched area in Fig. 9.5. For an ideal 
gas the work W,_, is also equal to 


M 
Wi-2 = w R (Ta— T1). 
9.5.7 The element of the quantity of heat 5Q transferred to 


the gas in the isobaric process is 


where Cp, is the molar heat capacity at constant pressure. 
If the quantity Cpu can be assumed constant in the temperature 


range (T-T), then the quantity of heat Qi- transferred to 
(or conducted away from) the gas is 


M 
oo aaa C pu (T,—T}). 


9.5.8 The molar heat capacities Cp, and Cy, of a gas are re- 
lated by the Mayer equation 

Cpu == Cyn = R. 

For specific heats cp and cy, the Mayer relation is of the form 
cp — cy = R/p, 


where p is the molar mass (8.4.3) of the gas. 
For the heat capacities Cp and Cy 


M 
l OER 
Pp V u , 
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where M is the mass of the gas and M/y is the number of moles 
it contains. 

‘The Mayer relation indicates that in heating a gas isobarically 
hy one degree a greater quantity of heat must be transferred 
lo the gas than in isochoric 

heating by the same amount. pt 

"This difference between the 
quantities of heat should be 
equal to the work done by the 
vas in its isobaric expansion. 

9.5.9 The isothermal process 
(8.3.7) of expanding (or com- 
ese) a gas takes place in 
icat exchange (9.2.4) between 
the gas and the external me- 
dium at a constant tempera- 
lure difference. For this to be 
leasible, the heat capacity 
(9.5.1) of the external medium Fig. 9.6 

should be sufficiently great and 

the process of expansion (or 

compression) should proceed extremely slowly. First-order 
phase transitions (12.3.3), such as boiling and condensa- 
lion, are isothermal processes if they proceed at constant ex- 
ternal pressure. 

In an isothermal process in an ideal gas the internal energy 
of the gas remains unchanged (9.5.5) and the whole quantity 
of heat Qi- transferred to the gas is expended in the work 
Wi- done by the gas to overcome the external forces: 


Y m ā PN M V 
Qi-2=Wy-2 = \ pdV = ae \ Top May: 
Vi V1 


llere M/u is the number of moles contained in mass M of the 
yas, T is the constant temperature of the gas, and V, and FV, 
ure the initial and final volumes of the gas. If the gas isother- 
mally expands (V, > V,), heat is being transferred to it 
(12 > 0) and it is doing positive work (W,_, > 0) represented 
hy the hatched area in Fig. 9.6. In isothermal compression of 
the gas (process 1-3 in Fig. 9.6), the work done by the gas is 
negative (W,_3 < 0). Positive work (Wi- = —W,_3; > 0) is 


10* 
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being done by the external forces to compress the gas. In com- 
pression, acertain quantity of heat is conducted away from the 
gas (Qi-3 < 0). The heat capacity of a substance in an isother- 
mal process equals infinity (dT = 0 and ôQ Æ 0). 

9.5.10 An adiabatic process (8.3.7) proceeds under the condi- 
tion that 6Q = 0. It is essential that the condition Q = 0 
does not define this process, because it does not specify the 
absence of heat exchange with the external medium. It only 
indicates that the algebraic sum of the quantities of heat trans- 
ferred to and conducted away from the gas in various parts 
of the process equals zero. In an adiabatic process, work is 
done by an ideal gas at the expense of its internal energy: 


eee; eeemacick 


Cyu aT, 


where Cy, is the molar heat capacity of the gas at constant 
volume, M/u is the number of moles contained in mass M of 
the gas, and dT is an element of change in the temperature 
of the gas. If the gas adiabatically expands (ôW = p dV > 0), 
it is cooled (dT <0). In adiabatic compression the gas is 
heated: ôW = pdV < 0 and dT > 0. 


9.5.11 The adiabatic, or Poisson, equation: pV“ = const, is 

valid for an equilibrium adiabatic process (8.3.7). 

Making use of the Mendeleev-Clapeyron equation (8.4.4), the 

relation between p and T, and also between V and T, in an 

adiabatic process can be found from the adiabatic equation: 
x 1 


pT”-'!=const and VT*T! =const. 


In these equations the dimensionless quantity x = Cp,/Cyy 
=cpley >1 is the adiabatic exponent. The full curve in 
Fig. 9.7, called an adiabat or adiabatic curve, represents an 
adiabatic process in a pV diagram. The dashed curve on this 
diagram, called an isotherm, represents an isothermic process 
at the temperature corresponding to the initial state 7 of the 
gas. In an adiabatic process the pressure varies with the volume 
more rapidly than in an isothermic process (the curve is steep- 
er). In adiabatic expansion the temperature of the gas and its 
pressure decrease faster than in the corresponding isothermal 
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expansion. In adiabatic compression of the gas its pressure 
increases at a higher rate than in the corresponding isothermal 
compression. The reason is that the increased pressure is due 
both to the decrease in the volume of the gas and to the increase 
in its temperature. 

9.5.12 The work W,_, done by the gas in the adiabatic process 
1 — 2 is measured by the hatched area in Fig. 9.7. 


Fig. 9.7 


‘The amount of work Wy,_, done in an adiabatic process is ex- 
pressed by 
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Wi- = ao (Ti— Tə), Wa ae ki Va; 
= PWVi _ =) —_ PiVy | -(7)"""] 
Wi- EN “%—1 (1 To ’ Wi- = x—1 À Va s 


In these equations M/u is the number of moles contained in 
mass M of the gas, Cy, is the molar heat capacity at constant 
volume, x is the adiabatic exponent, and p, V and T are the 
state variables of the gas in the states 7 and 2 as indicated. 
9.5.13 Table 9.1 lists summarized information on the isopro- 
cesses in gases, 
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CHAPTER 10 KINETIC THEORY OF GASES 


10.1 Certain Information on Classical 
Statistical Physics 


10.1.1 The kinetic theory of gases deals with the structure and 
physical properties of gases and is based on the statistical 
method of investigation (8.2.2). Besides those mentioned in 
Sect. (8.2.1), the following propositions form the foundation 
for classical statistical physics. 

10.1.2 A system of particles complies with the laws of conser- 
vation of energy (9.2.7), momentum (2.7.1) and of angular 
momentum (4.4.1). A system of charged particles complies 
with the law of conservation of electric charge (14.1.3). 
10.1.3 All physical processes in a system of particles proceed 
continuously in space and time. A space-time description of any 
physical phenomena in classical mechanics and classical sta- 
tistical physics assumes the possibility of continuous variation 
of all physical quantities that characterize the state of the 
system. For instance, the velocity and energy of any particle 
can vary continuously due to the action of various forces. 
10.1.4 Each particle in a system is regarded as being “tagged”. 
Tt can be distinguished from all other such particles (distin- 
guishability of identical particles in classical statistical physics). 
10.1.5 Any particle of a system can have arbitrary values for 
its coordinates and momenta (or velocities) independently 
of the values of these quantities for other particles of the system. 
If an infinitely small volume dz dy dz is chosen in the coor- 
dinate system XYZ, any particle can be located within this 
volume regardless of the presence in this volume of any arbi- 
(rary number of other particles. Likewise, any particle may 
be located within an arbitrary element of “volume” dp, dp, dp, 
of momentum space (or duy du, du, of velocity space) regardless 
of the presence in these “volumes” of any arbitrary number of 
other particles. This implies that any particle can have momen- 
(um components along the coordinate axes within the limits 
from py to py + dpy, from py to py + dpy and from p, to 
Pz + dp, (and, respectively, velocity components of the par- 
ticle along the coordinate axes within the limits from uy to 
Uy + duy, Uy to uy + du, and u, to u, t- du,). The minimum 
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values of the volumes dz dy dz and dp, dp, dp, are not restricted 
in any manner. 

Sections 10.1.2 and 10.1.3 apply not only to classical statistical 
physics, they are characteristic of all of classical physics. 
Sections 10.1.4 and 10.1.5 apply only to classical statistical 
physics; they do not hold for quantum statistics (41.1.1). 


10.2 Basic Equation of the Kinetic Theory of Gases 


10.2.1 The pressure of a gas (8.3.2) in a vessel is the result 
of the collisions of the gas molecules with the walls of the 
vessel. The pressure of a gas is the macroscopic effect of the 
thermal motion of molecules (8.1.1). As the molecules of the 
gas strike the walls of the vessel (8.4.1) they impart their 
momentum (2.3.4) to the walls. This change in momentum of 
the molecules exerts the pressure on the walls. For an ideal 
gas the collisions between the molecules within the vessel has 
no effect on the pressure exerted by the gas on the walls. Owing 
to the chaotic nature of thermal motion of the molecules (8.1.3) 
the pressure of the gas is the same on all the walls and, by 
definition (8.3.2), is the average force exerted normal to unit 
area of the wall surface. 

10.2.2 The basic equation of the kinetic theory of gases is 

2 


pV = 3 Fx, 


where p is the pressure of the gas, V is its volume and 


mu? 
Eg = > 5 is the total kinetic energy of translational 


i=! 
motion of all the N molecules of gas in the vessel. 
10.2.3 In a homogeneous gas m; = m, i.e. the mass of each 
molecule is the same, but their velocities u; differ (10.3.1), 
and 
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It proves expedient to introduce the root-mean-square velocity 
Vrms Of translational motion of gases: 


\ 
A N > uz, 


i=1 


where N is the total number of molecules in volume V of the 
gas. The root-mean-square velocity pertains to the whole 


<E 
7 
7 
| 
g T 
Fig. 10.4 


population of molecules, but is meaningless in application 
to a single molecule or to a small number of molecules. Making 
use of vV,ms the expression for Ey, assumes the form 


and n E N mv? ns = 


1 
Ek = Nmv? 3 


rms 


where M = Nm is the mass of the gas. 
The basic equation for the pressure of a gas is 


2 1 1 


= MENA E NE a8 
P =a Ba 3 nomlrms = 3 PUrms? 


where Epo = E,/V, no = N/V is the number of molecules in 
unit volume, and p = ngm is the density of the gas, 
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10.2.4 From a comparison of the Mendeleev-Clapeyron equation 
(8.4.4) with the basic equation (10.2.3) RT = pv? ,/3 it follows 


that 
p 3 RT =y 3 RT =y 3kT = — 
wm= = mN s A V pv, 


where k is Boltzmann’s constant (8.4.5), m is the mass of a mol- 
ecule, Na, is Avogadro’s number (Appendix II), p is the pres- 
sure of the gas, and v is its specific volume. 

10.2.5 The average kinetic energy of translational motion of the 
molecules in an ideal gas is 


E mv? m 3 
my, __ k __ rms __ 
ye ng rer 


The average kinetic energy (Ek) is proportional to the abso- 
lute temperature and is independent of any other variables 
(Fig. 10.1). The absolute, or thermodynamic, temperature is 
a measure of the average kinetic energy of translational motion 
of the molecules in an ideal gas. This consequence of the kinetic 
theory of gases is not valid in the region of ultralow tempera- 
tures, close to absolute zero (8.3.4). 


10.3 Maxwell's Molecular Velocity and Energy 
Distribution Law {Maxwell's Distribution Law] 


10.3.1 The law of the distribution of molecular velocities for an 
ideal gas, theoretically established by Maxwell, determines the 
number dn of molecules of a homogeneous monatomic ideal 
gas, out of the total number n, of its molecules in unit volume, 
that has a velocity, at a given temperature, lying within the 
range from u to u -+ du. This law is valid for gases in a state 
of thermodynamic equilibrium (8.3.3). The velocity distribu- 
tion of the molecules of such a gas is a steady-state value. 
The Maxwellian distribution is based on pairwise collisions of 
the randomly moving gas molecules. This assumes uniform 
distribution of the molecules throughout the volume of the 
vessel, i.e. constant density of the gas, 
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10.3.2 In its most customary form, the law of the distribution 
of molecular speeds (magnitudes of the molecular velocities) 
is presented as 


mu2 


le a iia 4nu? du 


dn=ne (sar 


Here u is the magnitude of the molecular velocity (speed of 
a molecule), m is the mass of a molecule, k is Boltzmann’s 


an 
du 


<N 
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Fig. 10.2 


constant, and 7 is the absolute (thermodynamic) temperature. 
Molecular velocity distribution curves for various temperatures 
Ti < T < T, are shown in Fig. 10.2. As is evident from these 
curves, the most probable velocity up (10.3.4) increases with 
temperature and the fraction of molecules travelling at this 
velocity decreases. 

10.3.3 Another form of the Maxwellian distribution is 


mu? 
m yn" ~ ORT 


dn = No ( OnkT duy duy duz, 


where ux, uy and u, are projections of the molecular velocity 
on the coordinate axes. Owing to the random nature of thermal 
motion of the molecules (8.1.1), distributions of the molecules 
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according to the projections u; of their velocities on the coor- 
dinate axes (i = z, y, z) are mutually independent. Hence 


dn = nof (Ux) f (Uy) f (uz) duy du, du,, 


where 
2 


m Dh Ser ots 
f (ui) = (=r } e *kT (i= x, y, 2) 


is a distribution function of the projections of the molecular 
velocities. 

The Maxwellian distribution is isotropic. This is evident from 
the fact that the function f (ux, uy, uz) depends only upon 
the speed (magnitude of the velocity), and because the function 
f (u;) is the same along all the axes. 

10.3.4 The law of the distribution of molecular velocities 
(10.3.2) can be used to determine the most probable velocity 
Up of the molecules, which corresponds to the maximum of the 
function 


m 
m eo ORT 
AnkT i 
mu2 


The condition |-> e ne u?) | =0Q0 yields 
du U=Up 


2kT 2RT 2 
by = aa os u Vrms 3° 


10.3.5 A third form of the law of the distribution of molecular 
velocities is 


d 
f (u) = — = 4atng ( 


an Ano owl ( u J du 


Vn up } up 


up g 
The fraction dn/n of the molecules of a gas having velocities 
lying within the range from u to u + dy is numerically equa] 
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to the hatched area dS under the curve in Fig. 10.3, which 
represents the dependence of (up/n ) (dn/du) on u/up: 


u 
as = ( p =) i ee 
up No 


The area under the curve in Fig. 10.3, i.e. bounded by the 
curve and the axis of abscissas, is equal to unity. This area 


Fig: 10.3 


represents the fraction of the molecules having all possible 
velocity values from 0 to oo. 
10.3.6 The arithmetic mean velocity (u) of translational motion 
of the molecules in an ideal gas, calculated by means of the 
distribution law (10.3.2), is 


8g SRT SkT = 
(4) = Vrms V eV am =y a a 


10.3.7 The energy distribution of the molecules in an ideal 
gas determines the fraction dn mno of molecules, of their 
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total number ny, having kinetic energies EẸ = mu?/2 lying 
within the range from E to Ep + dE,’: 


Ek 
2n >. Pr VY pn 
dn am =e (ery a e RT V EM dED, 
Here dn jm! no = f (Ej) dE% , where f (Ej) is the distribution 
function for the molecular energies in an ideal gas. 


Example. The average kinetic energy (Eķ ) of a molecule in 
an ideal gas is 


oo 


i ess 
EMe "! V EM gem 


e 2 
Em oe \ E Be dE™ —— \ 5 


3 
oF kT 


(cf. Sect. 10.2.4). 

10.3.8 The relative motion (7.1.2) of two particles with the 
masses m, and m, is equivalent to the motion of a single particle 
with the reduced mass m,eq = m,m,/(m, + ma). For a homo- 
geneous gas my = M = m and myeqg = m/2. The distribution 
of molecules according to their relative velocities establishes 
the fraction dny,,)/n) of molecules, of their total number no, 


having relative velocities u,.) lying within the range from 
Urel tO Urel + dUpel: 
2 
murel 
m ~ o RT 


Anup) = No (=r 4nu? 


re] frel. 


Here dnupej/ no = f (Urel) durei, Where 
mufel 
m Bf A> Sa 
f (ure) = 40 (azar) e ART Uko] is the relative velocity 


distribution of the molecules in an ideal gas. 
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Example. The average relative velocity (ure}) of the mole- 
cules is : 


(Urel) = \ Urelf (Urel) dure] = V2 a y2 (u), 
0 


where (u is the arithmetic mean velocity of the molecules 
(10.3.6). 


10.4 Particle Distribution in a Potential Force Field 
(Boltzmann Distribution) 


10.4.1 Gases are ordinarily in the potential (conservative) 
gravitational fields of the earth (6.2.1). If there were no such 
gravitational field, the atmospheric air would be dispersed 
in the universe. If, on the other hand, there were no thermal 
motion, the molecules of atmospheric air would drop to the 
earth. Gravity and thermal motion lead to a steady state of 
gases, in which their concentration and pressure decrease with 
an increase of their altitude above the earth’s surface. 

10.4.2 If a gas (or other system of particles) is located in an 
external potential (conservative) force field (3.1.6), the distri- 
bution of the particles throughout the volume is described by 
the Bolizmann distribution law. This law establishes the number 
of particles dn (z, y, z) that have coordinates within the ranges 
from x to x + dz, from y to y + dy, and from z to z+ dz. 
In other words, dn (x, y, 2) is the number of molecules contained 
in the element of volume dV = dz dy dz. The Boltzmann 
distribution law is of the form 


Ey (x) Ys 2) 
dn (x, y, z)=const x e RE dx dy dz 


where E5 (x, y, z) is the potential energy of a particle in the 


external force field, k is Boltzmann’s constant, and T is the 
absolute temperature. The value of the constant in the preceding 
equation is determined from the normalization condition: 


dn= no, Where nọ is the total number of particles in unit 


volume. 
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Example. For particles of mass m, located in the earth’s gravi- 

tational field, ED = mgh (3.3.3), where g is the acceleration 

due to gravity (7.3.3), and h is the altitude. At each altitude 

there is a Maxwellian molecular velocity distribution (10.3.1). 

The number of molecules contained in the volume dV is 
mgh 


aet 


dn (x, y, z2)=const Xx e RT ay. 


The gas density ọ = m dn/dV decreases with an increase in the 
altitude according to the exponential law 


mgh 
RT 


p= const X e 


The value of the constant can be determined from the condition 
= 0, = const at h = 0. The density of the gas or its pressure 
varies according to the barometric height formula 


mgh B mgh 
p=pọẹ "T and p=pe "T. 


10.4.3 The Mazwell-Bolizmann law, or distribution, describes 
the distribution of the coordinates and velocities of gas mole- 
cules subject to the action of an arbitrary potential (conserva- 
tive) force field: 


c m 
P+tPy+p? Ep (Xs Y, 2) 
2mkT 


dn =const X e dp» dpy dpe ae dx dy dz, 
where dn is the number of molecules contained in six-dimensio- 
nal space within the element of volume dl! =dz dy dz dp, dp, dPz, 


and E™ (z, y, z) is the potential energy of a molecule in the 


external force field at a point with the coordinates z, y and z, 
and the projections py, py and p, of the momenta on the axes. 
The Maxwell-Boltzmann law is the product of two distribution 
fuactions. One of them describes the distribution with respect 
to coordinates, and the other with respect to momenta (or 
velocities). 


10.5) MEAN FREE PATH OF MOLECULES 164 


10.5 Mean Free Path of Molecules 


10.5.1 Gas molecules have finite dimensions (8.4.1) and are 
continuously colliding with one another in thermal motion. 
Between two successive collisions, molecules travel at uniform 
velocity in a straight line over certain distances called their 
free paths à. The mean free path (i) is the average distance that 
a molecule travels without a collision. The mean free path is 
a characteristic of the whole totality of gas molecules at given 
values of p and T. 

10.5.2 In unit time each molecule undergoes a mean number 
of collisions (z) (average collision frequency) equal to 


(2) = nd?no (ure) = V 2 1d? ng (u). 


Here d is the effective diameter of the molecule (8.4.1), no 
is the number of molecules in unit volume of the gas, (upel) 
is their average relative velocity (10.3.8), and (uw) is their 
arithmetic mean velocity (10.3.6). 

10.5.3 The average distance travelled by a molecule in unit 
time is numerically equal to (u). Therefore (uw) = (A) (z). 
The mean free path (A) is 


et) ere 
ac ae ree 


At constant temperature, ng is proportional to the pressure p 
of the gas (8.4.5). Hence, for a given gas the mean free path 
is inversely proportional to the pressure: 


Py (Ay) = Pe (Ag) = Const, 


in which the subscripts 1 and 2 refer to two states of the gas. 
10.5.4 If molecules are ejected from some source of particles 
and a beam is formed by means of a diaphragm, the free path 
distribution law is valid for the molecules of the beam: 


N= Noel). 


where N is the number of molecules in the beam that have 
travelled a distance z without collisions, and N, is the number 
of molecules in the beam at x = 0, i.e. at the output of the 
diaphragm. 
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10.6 Principle of the Equipartition of Energy. 
Infernal Energy of an Ideal Gas 


10.6.1 The number of degrees of freedom of a body is the minimum 
number of coordinates (number of independent coordinates) 
that it is necessary to specify in order to completely determine 
the position of the body in space. 
A particle, for instance, travelling along one of the coordinate 
axes has a single degree of freedom. The same particle travelling 
in a plane has two degrees of 
freedom. The position of a par- 
ticle travelling freely in space 
is determined by three degrees 
of freedom: the coordinates z, 
y and z. A rigid body (1.1.4) 
has six degrees of freedom: its 
position in space is determined 
by the three coordinates of its 
centre of mass (2.3.3), two an- 
gles, 0 and qg, determining the 
direction of a certain axis, 
Fig. 10.4 fixed in the body and passing 
through its centre of mass 
(Fig. 10.4), and, in addition, 
angle wp, determining the direction ofa second axis, fixed 
in the body and perpendicular to the first axis. Changes 
in the three coordinates of the centre of mass at given 
constant angles 0, @ and » correspond to translational motion 
of the rigid body. The coordinates of the centre of mass are 
the three degrees of freedom of translational motion. Changes 
in the angle 0, ~ or w at a constant position of the centre of 
mass lead to rotation of the rigid body. Hence, the correspond- 
ing degrees of freedom are said to be rotary. Additional degrees 
of freedom of oscillatory, or vibratory, motion (10.6.5) are 
introduced to determine the position in space of an imperfectly 
rigid body, whose various parts can be displaced with respect 
to one another. 
10.6.2 In many problems a molecule of a monatomic gas can 
be regarded as a particle (cf Sect. 8.4.1). This is based on the 
fact that practically all the mass of such an atom is concentrated 
in its nucleus (42.1.1), which is of extremely small size. The 
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molecule of monatomic gas has three degrees of freedom of 
translational motion. Its average kinetic energy (Ej) is equal 


to the kinetic energy of molecules travelling at a velocity 
equal to the root-mean-square velocity vpms (10.2.4): 


3 


(EB) =< kT. 


Owing to the random nature of thermal motion of the mole- 
cules, this energy is uniformly distributed between the three 
degrees of freedom. Hence, on an average each degree of free- 


Fig. 10.5 Fig.410.6 


dom of translational motion of a monatomic molecule accounts 


for the same amount of kinetic energy (Z;,) = (Ey')/3 = 


-= kT/2, where k is Boltzmann’s constant and T is the absolute 
temperature. 

10.6.3 As a first approximation, a molecule of a diatomic gas 
consists of two atoms, A and B, rigidly held together by an 
incompressible, undeformable bond (Fig. 10.5). In addition 
to the three degrees of freedom of translational motion, the 
centre of mass C (2.3.3) of such a molecule has two more degrees 
of freedom of rotary motion about axes O,-O, and O,-O,. Rota- 
lion about the third axis O’-O’ makes no contribution to the 
molecule’s energy because the moment of inertia (4.2.1) of 
the atoms with respect to this axis is negligibly small. Thus, 
a diatomic molecule has five degrees of freedom. Molecules 


11 * 


164 7 KINETIC THEORY OF GASES [10 


consisting of three or more atoms have six degrees of freedom: 
three translational and three rotary degrecs (Fig. 10.6). 
10.6.4 The principle of equipartition of energy states that 
on an average each degree of freedom of a molecule accounts 
for the same amount of kinetic energy, equal to kT/2.I ia mol- 
ecule has i degrees of freedom its average kinetic energy 
(Ee y= ah Ti2, 

The principle of equipartition of energy leads to the conclusion 
that all degrees of freedom of the molecule are equally justified: 
each makes the same contribution to its average energy. This 
conclusion actually has only a limited field of application, and 
has been revised for quantum statistics (41.1.1). 

10.6.5 In real molecules, which are not absolutely rigid, vibra- 
tory degrees of freedom make a contribution to the energy of 
the molecule in accordance with the principle of equipartition 
of energy (10.6.4). Each vibratory degree of freedom accounts, 


not only for kinetic energy £}', but for potential energy Ep 


as well. For harmonic vibrations (28.1.3), (Ey) = (Ep) 
According to Sect. 10.6.4 


1 
my TN nw 
(Eko =U A 9 kT, 
where (Ek) and (Ep) are the average kinetic and potential 
energies of a molecule per vibratory degree of freedom. 


The average value (£}') of the total energy of a molecule per 
vibratory degree of freedom is 


(ED) = (ER) + (ER) = 2 (Ei) = kT. 


This equation is valid for harmonic vibrations of particles 
(atoms, molecules or ions) in the crystal lattice sites of sol- 
ids (8.1.4). 

10.6.6 The internal energy (9.1.2) of a polyatomic gas consti- 
tutes the kinetic energy of all kinds of motion of its particles. 
For one mole of such a gas 


. i _ RT 
U=(E\’) Na=—z kN aT =i = — , 
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where Na is Avogadro’s number (Appendix II), i is the number 
of degrees of freedom of the molecule (10.6.4), and R is the 
universal gas constant (8.4.4). 


10.7 Heat Capacity of Monafomic, Diatomic and 
Polyatomic Gases 


10.7.4 The molecules of a monatomic gas have three degrees 
of freedom (10.6.2) and, according to Sect. 10.6.6, the inter- 
nal energy of one mole of such a gas is U = 3kN,T/2. 
‘The molar heat capacity of such a gas at constant volume (9.5.4) 
is 


| dU 3 3 J joules 
T = Z — =x — R — 4 hs a E A 
C vy aT 2 Nak 2 ae ork | mole X kelvin 
cal cal 
x3 “moLkK. because Ra2 aol. P 
For a polyatomic gas whose molecules have i degrees of freedom 
i iR . I `. cal 
aa Bia eT eT’ 


10.7.2 The molar heat capacity Cp, at constant pressure (9.5.7 
and 9.5.8) of a gas whose molecules have i degrees of freedom is 


r G--2)R J ee cal 

Opn = 2 ROAR) mol-K AA mol-K ° 
The principle of equipartition of energy (10.6.4) leads to the 
conclusion that the heat capacity of a gas depends upon the 
number of degrees of freedom of its molecules, but does not 
depend on the temperature. Experimental data disprove these 
conclusions of the classical theory of heat capacity. Investiga- 
lions indicate that the heat capacity of gases increases and 
decreases with temperature. 

10.7.3 A theoretical explanation of experimental data on the 
dependence of heat capacity on temperature in a wide tem- 
perature range is given in the quantum theory of heat capacity. 
This theory takes into account the following results obtained 
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in a gquantum-mechanical description of the properties of 
diatomic and polyatomic molecules: 

(a) Of the four parts that comprise the energy E™ of the mole- 
cule* (39.5.2), i.e. 


Em = EM 1 AM4 pM AM, 


only EY, the energy of translational motion of the molecule’s 


centre of mass (2.3.3), varies continuously (not by discrete 
amounts). The values of all other kinds of internal energy 
of molecules are quantized and thereby assume only a discrete 
series of values (39.4.5). This refers to the following kinds of 


energy: Ep, the energy of motion of the electrons of the mole- 
cule; EY’, the energy of vibrational motion of the nuclei in the 


molecule; and EP, the energy of rotary motion of the molecule. 


(b) As a first approximation, the kinds of internal motion 
of the molecule can be assumed independent of one another. 
Thus, at small amplitudes of vibration of the nuclei, the changes 
in the moments of inertia (4.2.1) of the molecule due to vibra- 
tion can be neglected, as can the effect of the vibrational motion 
in the molecule on its rotation. 

(c) No changes in the energy of motion of the electrons occur 
when a substance is heated by one degree at ordinary tempera- 
tures. Adjacent energy levels of the electrons in the molecule 
are separated by energy of the order of several eV, correspond- 
ing to temperatures of several tens of thousands of degrees 
(10.7.6). Therefore, the energy of motion of electrons in the 
molecule can be neglected in dealing with the heat capacities 
of monatomic and polyatomic gases up to the very highest 
temperatures. 

10.7.4 As a first approximation, the vibrational motion of 
the nuclei in the molecule is described as the vibration of 
a harmonic oscillator (38.5.4), whose mass m is equal to the 
reduced mass of a system of vibrating atoms, m = mm,/(m, + 


* This ignores the nuclear energy of the molecule, which has 
no influence on the heat capacity of molecules, 
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-+ ma), where m, and m, are the masses of the atoms. Then the 
energy of vibrational motion of the molecule equals 


1 
ER =hvo (n 4 =) l 


where n is the quantum number and assumes integral values: 
n = 0, 1, 2,..., Vo is the natural frequency of vibrations 
(38.5.5), and h is Planck’s constant (Appendix II). The energy 
of a harmonic oscillator, equal to hv,/2 at n = 0, is conserved 
at any arbitrarily low temperatures, and even at T —> 0 (38.5.6). 


The difference AE, between the energies of two adjacent vibra- 


tional energy levels, AET = hvg, is independent of the quantum 
number. Valid at room and lower temperatures is the condi- 
tion: AE” >> kT. No changes are observed in the energy of 


vibrational motion of molecules at such temperatures. In cal- 
culating the heat capacity of gases under such conditions, 
vibrational degrees of freedom of the molecule can be neglected. 
In the general case, vibrational motions make contributions 
to the internal energy Uy (9.1.2) and the heat capacity Cyu 
(9.5.2). For one mole 


Sg OV 
jea Nahy ( or awe a ) 
, Tig 7 hy hy ’ 
(IT e QRT ) 
AV aN 
Nak hy 2( e 2hT Sep 2kT 
Cvm =} (+r) 7 : 


where T is the absolute temperature, k is Boltzmann’s constant 
(8.4.5) and Na is Avogadro’s number (Appendix II). At high 
temperatures (T > hv/k) 


Uy = Na kT = RT, Cyyy > Nak =R. 


These results coincide with those of classical theory which 
follow from the principle of the equipartition of energy (10.6.4). 
Under these conditions the energy of the oscillator can be 
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assumed to vary continuously because AE < kT. At suf- 
ficiently low temperatures (T < hv/k) 


hv 
h PEAS 
Uy = AA 4 Nahve ia , and 
OE oe 
V RT 
Cvuy X Nak (=) e . 


As T +0, the energy Uy > N ahv/2. This quantity is called 
the zero-point vibrational energy of the system (38.5.6). As 
T —> 0, the heat capacity Cyu, —> 
.-> 0 in accordance with the third 
- law of thermodynamics (11.8.4). 
The dependence Cyu, = f (T) for 


diatomic gases is shown in Fig. 
10.7. 
10.7.5 As a first approximation, 
7 the rotary motion of diatomic 
molecules can be dealt with as 
the motion of a rigid dumbbell 
which rotates about its centre 
of mass with the moment of 
inertia J = m,mgr3/(m, + mə), where m = mıma/ (m; + mo) is 
the reduced mass of the molecule (m, and m, are the masses 
of the two atoms), and r, is the distance between the atoms 
in the molecule. The energy of such a system is 


Cyu U 


Fig. 10.7 


h2 

m _ 
E = Srl J (J+), 
where J is a quantum number that assumes the values J = 
= 0,1, 2,.... The difference AEP between the energies 
of two adjacent rotational energy levels is 

2h2 
m 

AER = r (T1) 


The value of AEP is less than AE," by a factor of 800 to 1000. 
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At ordinary temperatures AE) < kT for diatomic and certain 


polyatomic gases (water vapour, methane, etc.), and the quan- 
tization of the energy of rotary motion of the molecules can 
be ignored. Under these conditions, the principle of equiparti- 
tion of energy (10.6.4) can be applied in calculating the contri- 
bution of rotary motion to the heat capacity. 

The contribution of rotary motion of diatomic molecules to 
the internal energy and heat capacity per mole equals: 

(a) At high temperatures (T >> h?/8n?k/) 


h2 
2402 TkT 
The heat capacity Cyr at high temperatures has a value fol- 


lowing from the principle of equipartition of energy (10.6.4). 
(b) At low temperatures (T < h?/8n2k/) 


Up x NakT (4 ) , and Cypr œ Nak=R. 


h2 
Sh2N A  4n2IrT 
r ay , and 
h2 
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as T — 0, the heat capacity Cyy,— 0 (10.7.4). The general 
shape of the curve representing the dependence Cy,,, = f (T) 
is the same as that in Fig. 10.7. 

As the gas cools, the reduction in the energy of its molecules 
leads to a reduction in the number of molecules transferring 
to higher rotational energy levels. At sufficiently low tem- 
peratures, the rotation of the molecules in a gas practically 
cannot be excited, and rotational degrees of freedom make 
no contribution to the heat capacity of the gas. The heat capa- 
cities of all gases at low temperatures become the same as for 
monatomic gases whose molecules have no rotational degrees 
of freedom. 

10.7.6 At extremely high temperatures, of the order of tens 
of thousands of degrees, the heat capacity is increased. This is 
associated with the fact that the value of kT under such condi- 


tions becomes comparable, in order of magnitude, with AE, 
the change in the energy of the electrons in their transition 
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from lower to higher energy levels. Moreover, at high tem- 
peratures the processes of dissociation and ionization of the 
gases make contributions to their heat capacities. 


10.8 Transport Phenomena in Gases 


10.8.1 Transport phenomena unite a group of processes pertain- 
ing to nonuniformities in the density, temperature or velocity 
of ordered motion of various layers of a substance. The equali- 
zation of these nonuniformities leads to transport phenomena. 
Transport phenomena include diffusion, internal friction and 
heat conduction. 
10.8.2 Transport P in gases and liquids consist of 
ordered directional mass transport in these substances (diffu- 
sion), momentum transport (internal friction) and internal 
energy transport aea conduction). In gases these phenomena 
violate the completely random motion of the molecules and 
Maxwell’s molecular velocity distribution (10.3.2). Deviations 
from the Maxwell law can be explained by the directional 
transport of the physical characteristics of the substance in 
such phenomena. In the simplest cases of one-dimensional 
transport phenomena, the physical quantities that govern 
them depend on only a single Cartesian coordinate. 
10.8.3 Diffusion is the spontaneous interpenetration and mixing 
of the particles of two adjoining gases, liquids or solids. In 
chemically pure gases at constant temperature, diffusion occurs 
owing to differing densities in various parts throughout the 
volume òf the gas. In a mixture of gases, diffusion is caused 
by the difference in the densities of the various gases at various 
parts of the volume of the gas mixture. 
In a chemically homogeneous gas the phenomenon of diffusion 
consists in the transport of a mass of the gas from places with 
higher density to places with lower density. This phenomenon 
obeys Fick’s law: 

p 2L 


Msec = — dr ° 


Here msec is the specific mass flow and is numerically equal 
to the mass that passes by diffusion in unit time through a plane 
‘surface of unit area located perpendicular to the direction the 
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substance is being transported, p is the density of the gas, 
and D is the diffusion coefficient. The derivative do/dz is numer- 
ically equal to the density gradient, i.e. the change in density 
per unit length. The diffusion coefficient is numerically equal 
to the specific mass flow at unit density gradient. The minus 
sign in Fick’s law indicates that mass transport takes place 
in the direction of decreasing density. 

Another form of Fick’s law is 


dno 


ei ee ee 


where j = msec/m is the density of the molecular flow in dif- 
fusion, i.e. the number of molecules diffusing in unit time 
through the surface indicated in the first form of Fick’s law, 
ny is the molecular concentration, equal to the number of 
molecules per unit volume, and m is the mass of one molecule, 
so that p = nom. 

The expression obtained in the kinetic theory of gases for the 
diffusion coefficient is 


D=-5 (u) (h), 


where (uw) is the arithmetic mean velocity of thermal motion 
of the molecules (10.3.6), and (A) is the mean free path (10.5.1). 
10.8.4 Internal friction (viscosity) is the occurrence of friction 
forces between layers of a gas or liquid that flow parallel to 
one another at different speeds. A layer flowing faster exerts 
an accelerating force on a slower layer. On the other hand, 
a slow layer retards a faster flowing layer of fluid. The forces 
of internal friction (viscosity) are directed tangent to the 
contacting surfaces of the layers. The cause of viscosity (as 
internal friction in gases and liquids is more commonly called) 
is the superposition of ordered motion of layers of fluid at 
various velocities v and the thermal random motion of the 
molecules at velocities depending upon the temperature. The 
random motion of the molecules transfers them from layer B 
flowing at the velocity v, into layer A flowing at the velocity v, 
(Fig. 10.8). This leads to transport of momenta mv of ordered 
motion of the molecules. When v, > və, the molecules that 
entered layer A from layer B have their velocities of ordered 
motion accelerated by collisions with the molecules from the 
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new layer. This slows down the velocities of ordered motion 
of the molecules in layer A. On the other hand, molecules 
passing from faster layer A to slower layer B have higher mo- 
menta mv, so that the inter- 
molecular collisions in layer B 


A U, accelerate the motion of its mol- 
The phenomenon of viscosity is 
f > y described by the Newtonian fric- 

dv 


Fig. 10.8 
where t is the friction stress, i.e. 

, the physical quantity equal nu- 
merically to the force of internal friction acting per unit 
area of the surface of the layer, dv/dn is the velocity gradient, 
i.e. the variation in the velocity of layer motion per unit length 
in the direction of the inner normal n to the layer surface. 
The internal frictional force is opposite in direction to the 
derivative of the gas velocity vector v with respect to n. 
The quantity ņ is called the coefficient of dynamic viscosity 
(or just the viscosity). It is numerically equal to the friction 
stress at a velocity gradient equal to unity. 
The coefficient of kinematic viscosity is the quantity v = n/p, 
where p is the density of the substance. 
o coefficient of dynamic viscosity is calculated by the equa- 
ion 


n = -5 (u) (A) p, 


where (uw) is the arithmetic mean velocity of thermal motion 
of the molecules (10.3.6), and (A) is their mean free path 
(10.5.1). The coefficient of dynamic viscosity is independent 
of the pressure (or density) of the gas since (A) œ 1/p. This 
can be explained by the fact that in an isothermal increase 
in the density of a gas, for example, by two times, the number 
of carriers of momenta is doubled, but each molecule (or atom) 
travels only one half the distance without a collision and carries 
only one half as much momentum. Hence, all in all, momenta 
transport remains unchanged, 
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10.8.5 The third transport phenomenon, heat conduction, occurs 
under the condition that a difference in temperature exists 
in a body in a certain direction. For instance, the two opposing 
walls of a vessel holding a gas can have different temperatures 
that are maintained by external sources. Then the gas molecules 
at various locations throughout the volume have different 
average kinetic energies (10.2.4). Under such conditions, the 
random thermal motion of the molecules leads to directional 
energy transport (or transfer) in the form of heat (9.2.6). Mol- 
ecules passing from the heated parts of the volume of gas to 
the colder parts yield some of their average kinetic energy 
to the surrounding molecules in the process of molecular colli- 
sions. Slow molecules, on the contrary, increase their average 
kinetic energy, in passing from the less heated to the more heated 
parts of the gas, owing to collisions with molecules having 
higher velocities. 

ın one-dimensional heat conduction, when the temperature 
of the gas depends only on a single coordinate T = T (2), 
energy transfer in the form of heat proceeds along the OX axis 
according to the Fourier law of heat conduction (Fourier heat 
equation): 


where qsec is the specific rate of heat flow, a physical quantity 
numerically equal to the energy transmitted in the form of heat 
in unit time through a plane surface of unit area located per- 
pendicular to the direction of energy transport. The quantity K 
is called the thermal conductivity. It is numerically equal to the 
specific rate of heat flow at the temperature gradient d7/dz 
(the change in temperature over unit length) equal to unity. 
The minus sign in the Fourier law indicates that in heat con- 
duction energy is transported in the direction of decreasing 
temperatures. 

According to the kinetic theory of gases, the thermal conduc- 
tivity is equal to 


K == (u) (à) eve, 


where cy is the specific heat at constant volume (9.5.4). 
The other quantities are defined in Sect. 10.8.4. It follows 
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from the last equation that the thermal conductivity is inde- 
pendent of the gas density. The reasons are similar to those 
given in Sect. 10.8.4 to explain why y is independent of the 
density. 

10.8.6 The laws of transport phenomena, given in the subsec- 
tions 10.8.3, 10.8.4 and 10.8.5, can be written in another form: 


do ze dv 
dT 
dQ=—K -> ds dt. 


Here dM is the mass transported in diffusion during the time 
dt through the element of area dS, located perpendicular to 
the direction of diffusion; dQ is the amount of energy transported 
in the form of heat during the time dt through the same element 
of area dS, located perpendicular to the OX axis; and dF is 
the force of internal friction (viscous force) exerted on an ele- 
ment of surface of the layer having the area dS. The other 
quantities are defined in the subsections mentioned above. 


TABLE 10.1 
Transported | ma t = F la for tl 
Phenomenon ARETE r SAOR ae transport coefficient 
Diffusion Mass aM pat (u) (2) 
way e aga i 
dx 
Internal Momentum jdf = — adl daS | q= = (u) (A) p 
friction dn 3 
(viscosity) 
Heat con- Energy in dQ 1 
duction the form of aT K Ta (u) (à) evp 
heat = — K—— dS dt 


ax 
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10.8.7 Simple relations exist between the transport coefficients: 
K 
=o0D and —=1. 
n=p ney 


If any one of the transport coefficients is known the others can 
be found by these formulas (for known values of p and cy). 
When the coefficients are known, such vital characteristics 
of a gas can be determined as the mean free path (10.5.1) of 
its molecules and the effective diameter of the molecules (8.4.1). 
Information on transport phenomena is listed in Table 10.1. 


10.9 Properties of Rarified Gases 


10.9.4 A gas in said to be rarified (a dilute gas) if its density 
is so low that the mean free path (4) of the molecules is com- 
parable to the linear dimensions d of the vessel confining the 
gas. Such a state of a gas is also called a vacuum. The degrees 
of vacuum of gases are classified as ultrahigh ((A) > d), high 
((A) > d), medium ((A) < d) and low ((A) « d). The proper- 
ties of rarified gases differ from those of nonrarified gases only 
for the first three of these degrees of vacuum. This is evident 
from Table 10.2 which lists certain characteristics for the 
various degrees. 

10.9.2 In the state of a high vacuum (10.9.1) a reduction in the 
density of a rarified gas leads to a corresponding decrease in 
the number of particles without changing (A). This, conse- 
quently, reduces the number of carriers of momentum or inter- 
nal energy in viscosity and heat conduction. The transport 
coefficients in these phenomena are directly proportional to the 
gas density (cf. Sects. 10.8.4 and 10.8.5). Internal friction is 
essentially absent in highly rarified gases. Instead, only exter- 
nal friction exists between the moving gas and the walls of 
the vessel. This is due to the fact that the molecules change 
(heir momentum only in interacting with the walls. Under 
(hese conditions, as a first approximation, the friction stress is 
proportional to the density of the gas and velocity of its motion 
(cf. Sect. 10.8.4). The specific rate of heat flow in highly rarified 
gases is proportional to the temperature difference and to the 
gas density (cf. Sect. 10.8.5). 
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TABLE 10.2 
Vacuum 
Characteristic 
Low | Medium | High | Ultrahigh 
Pressure, 760 to1 | 1 to 10-3 | 40-3 to 1077 |1078 and less 
mmHg 


Number of} 1025 to | 1022 to 1019 | 1029 to 1013 | 1013 and less 
molecules in 4022 
unit volume 


(per m`?) 

Dependence Pressure Directly Heat con- 

of coeffic- dependence |proportional] duction and 
ients K and} None is deter- | to pressure | viscosity are 
y on the pres- mined by the practically 
sure parameter absent 


(d)/d 


10.9.3 A steady state of a rarified gas, contained in two vessels 
connected by a narrow tube, is possible under the condition of 
equality of the counterflow of particles passing between the 
vessels: nı (u1) = na (ug), Where n; and n are the numbers of 
molecules per cm? in the two vessels, and (u,) and (u,) are 
their arithmetic mean velocities (10.3.6). 

If T} and T, are the gas temperatures in the two vessels, the 
preceding condition for a steady state can be rewritten in the 
form of an equation that expresses the Knudsen effect: 


where p, and pz are the pressures of the rarified gas in the two 
vessels. 
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CHAPTER 11 SECOND LAW OF THERMODYNAMICS 


11.1 Cycles. The Carnot Cycle 


41.1.4 A cycle is a succession of thermodynamic processes 
(8.3.7) as a result of which the system returns to its initial state. 
In pV, pT and other thermodynamic diagrams, equilibrium 
cycles (8.3.7) are represented in the form of closed curves, be- 
cause two identical states, 
the beginning and end of 
the cycle, correspond to the 
same point of the diagram. 
A thermodynamic system 
that accomplishes a cycle 
and exchanges energy with 
other bodies is called the 
working medium, or sub- 
stance. This medium is 
usually a gas. 

41.1.2 The arbitrary equi- 
librium cycle C,aC,bC, 
(Fig. 11.1), using an ideal 
gas, can be divided into the 
process of expansion of the 
vas from state C} to state C, 
(curve C,aC,) and the process 
of compressing the gas from state C, to state C, (curve C,bC;). 
In expansion the gas does positive work W,, measured by the 
area under the curve C,aC, (the area of the figure V,C,aC,V2; 
see Sect. 9.4.2). The gas is compressed by external forces that 
do positive work W, = —W,, measured by the area under 
the curve C,bC, (the area of the figure V,C,bC,V.). Since Wy > 
> W, (Fig. 11.1), the gas does the positive work W = W, + 
-+ Wa = W, — W, in the cycle. This work is measured by 
the hatched area, bounded by the curve C,aC,bC, of the pro- 
cess. 

11.1.3 A direct cycle is one in which the system does positive 


y 


Fig. 11.4 


work W= QÙ p dV > 0. Ina pV diagram a direct cycle is repre- 


sented by a closed curve along which the working medium 
passes in the clockwise direction (Fig. 11.1). An example of a di- 


12—0186 
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rect cycle is that accomplished by the working medium in a 
heat engine. In such an engine the working medium (11.1.1) 
receives energy in the form of heat (9.2.1) from external sources 
and gives a part of it up in the form of work (9.2.1) (see Sect. 
11.1.5). 

11.1.4 A reverse cycle is one in which the system does negative 


work W = Ọ® pd4aV < 0. In a pV diagram a reverse cycle is 


represented by a closed curve along which the working medium 
passes in the counterclockwise direction. An example of a re- 
verse cycle is that accomplished by the working medium in a 
refrigerator. In such a device the working medium receives 
energy in the form of work and 
transmits energy in the form of 
heat from a cold body to a hot- 
ter one (see Sect. 11.1.5). 
11.1.5 Since the total change 
in the internal energy of a gas 
in a cycle is equal to zero 
(9.1.3), the first law of thermo- 
dynamics (9.3.1) for such a 
process is of the form 


Q=AU+W= W, 


where Q is the total quantity of 
Fig. 11.2 heat delivered to the gas in the 
a cycle, and W is the work done 
by the gas in this cycle. 
In a direct cycle (11.1.3) Q > 0 and A > 0, and the gas does 
work owing to the heat delivered to it. In a reverse cycle (11.1.4) 
the work done on the gas is W’ = —W (where W > 0) and a 
quantity of heat equivalent to this work is removed from the gas. 
11.1.6 The Carnot cycle is the one shown in Fig. 11.2. The 
direct Carnot cycle consists of four consecutive processes: isother- 
mal expansion 1-1’ at the temperature 7, (where Ty = 7;), 
adiabatic expansion 1’-2, isothermal compression 2-2’ at 
the temperature T, (where 7,’ = T), and adiabatic compres- 
sion 2-4. 
11.1.7 In practice a direct Carnot cycle could be carried out by 
means of a gas enclosed in a vessel with a movable piston. In 
the process 1-1’ the gas is in thermal contact with a heat 
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source (heater) at the constant temperature T¥. From the heater 
the gas receives a certain quantity of heat Q; (where Q, > 0). 
It is assumed that this does not change the temperature of the 
heater. This is possible, strictly speaking, only if the heater 
has an infinite heat capacity. In the process 1’-2 the gas is 
thermally isolated and its expansion is adiabatic** (9.5.10). 
In the process 2-2’ the gas is brought into thermal contact 
with a cooler (heat sink) having the constant temperature 7, 
(where 7, < Tı). Here the gas is isothermically compressed 
and gives off the quantity of heat —Q, to the cooler (assuming 
that Q is the quantity of heat received by the gas from the cool- 
er). The heat capacity of the cooler is assumed to be infinitely 
high. In the state 2’ the gas is again thermally insulated and 
contracts adiabatically to its initial state 1. 

11.1.8 The work done by the gas in an equilibrium (8.3.7) 
direct Carnot cycle is 


W = Q = Qı + Q: = Qi — | Qa I- 


It is evident from the equation that W < Q,, i.e. the work done 
by the working medium (11.1.1) in the Carnot cycle is less 
than the energy received from the heater by the amount of 
energy given up to the cooler in the form of heat. This is valid 
for an arbitrary process: the work W done in the cycle is always 
jess than the sum Qge) of all the quantities of heat delivered to 
the working medium from heaters. 

oe operation of heat engines is based on the direct Carnot 
cycle. 

41.1.9 The thermal efficiency yn is the ratio of the work W done 
by the working medium in a direct cycle to the sum Qge) of all 
quantities of heat delivered in this cycle to the working medium 
by heaters: 


o W 
7 Qadel j 


The quantity ņ characterizes the economy of operation of a heat 
engine. 


* For instance, a large tank filled with water. 
** Tor instance, the cylinder with the gas is covered with 
a thick layer of felt. 


2% 
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11.1.10 The thermal efficiency of a direct equilibrium Carnot 
cycle carried out with an ideal gas is 


= Qı tQ: 2 Ti— Tə 4 fa 
CS" SS a 
1 Ti 1 


n 


The value of yc depends only on the ratio of the temperatures 
of the cooler (T) and the heater (T1). 
11.1.11 The reverse Carnot cycle is shown in Fig. 11.3. In iso- 
thermal compression, carried out in the process 1’-1, the quan- 
tity of heat Q, is removed from 
the gas at the temperature T}, 
which remains constant. In the 
process of isothermal expansion 
2’-2 at the temperature T, < 
< T, the quantity of heat Q, is 
delivered to the gas. In the 
reverse Carnot cycle Qı < 0, 
Qœ 0 and the work W done by 
the gas in one cycle is nega- 
tive: W = (Q, + Q) < 0. This 
conclusion is valid for any re- 
verse cycle. If the working me- 
Fig. 11.3 dium carries out a reverse cycle, 
energy can be transferred in the 
form of heat (9.2.1) from a cold 
body to a hotter one at the expense of the corresponding work 
done by external forces. This principle is used in all refrigerat- 
ing devices. The less the work W’= —W done by external 
forces to extract the quantity of heat Q, from a cold body, the 
higher the economical efficiency (coefficient of performance) of 
a refrigerator: 


W 1— 
Q=W1—Q=W——-=—— AN, 


where y is the thermal efficiency of the direct cycle between the 
same temperatures 7, and 7, (11.1.3 and 11.1.4). 
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11.2 Reversible and Irreversible Processes 


11.2.1 A thermodynamic process is said to be reversible (rever- 
sible process) if the thermodynamic system (8.3.1), in carrying 
out the process first in the direct and then in the exactly re- 
versed order, returns, together with all the external bodies with 
which it interacts, to its initial state. In other words, in a rever- 
sible process the thermodynamic system returns to its initial 
state without leaving any changes in the surrounding medium. 
A necessary and sufficient condition for the reversibility of a 
thermodynamic process is the equilibrium (8.3.3) of all the 
consecutive states in the process. Here all corresponding states 
of the thermodynamic system in the direct and reverse pro- 
cesses are absolutely identical. 

Example 1. The mechanical motion of a body in a vacuum in the 
complete absence of friction is a reversible process. Assume, 
for instance, that a body under these conditions is thrown with a 
certain initial velocity into a gravitational field (6.2.1) at a 
definite angle to the horizon. After describing a parabolic path 
the body drops to the earth at a certain place. Now, if the body 
is thrown from this place at the same angle and the same initial 
velocity, but in the opposite direction, it describes the same 
path in the opposite direction and drops to the earth at the 
initial point. Any intermediate states of the moving body in 
the forward and reverse motions are entirely identical. The 
reversibility of: mechanical motions indicates their symmetry 
with respect to the interchanging of the future by the past, 
i.e. with respect to the reversal of the sign of the time. The re- 
versibility of mechanical motions follows from the differential 
equations of motion (2.4.4). When the sign of the time is 
changed, this changes the sign of the velocity as well, but the 
acceleration, in the motion equations, retains its sign. 
Example 2. Another reversible process is the undamped vibra- 
tions of a body suspended in a vacuum on a perfectly elastic 
spring (28.2.3). The body-spring system is a conservative one 
(3.1.7). Its mechanical vibrations do not change the energy of 
thermal random motion of its particles. Only changes in the 
configuration and velocity of the system lead to a change of its 
state. But these changes are completely repeated after each 
period T (28.1.2) of vibration. Hence, the conditions for the 
reversibility (11.2.1) of the process are complied with. 
11.2.2 Any process that does not comply with the conditions 
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for reversibility is said to be irreversible (irreversible process). 
Example 1. The direct process of braking a moving body by 
means of friction forces is an irreversible process. If these forces 
are the only ones acting on the body, its velocity is reduced 
and it finally stops. The energy of mechanical motion of the 
body as a whole is reduced and is expended in increasing the 
energy of random motion of particles of the body and the sur- 
rounding medium. The internal energy (9.1.2) of the body and 
the medium is increased; they are heated by the action of the 
friction forces. The direct process dealt with here proceeds spon- 
taneously; it is carried out without any processes taking place 
in surrounding bodies. For the reverse process to occur, return- 
ing the system to its initial state, it would be necessary for 
the braked and stopped body to start moving again by cooling 
it and the surrounding medium. As is evident from experiments, 
the thermal random motion of particles of a body cannot spon- 
taneously lead to the initiation of ordered motion of all the 
particles of the body as a whole. To accomplish such motion it 
is necessary to provide for an additional compensating process in 
which the body and surrounding medium are cooled to the 
initial temperature. In this process, the quantity of heat Q is 
given up to the cooler and the work A’ = Q is performed on 
the body. 

Thus, the consecutive performance of such direct and reverse 
processes returns the body-medium system to its initial state, 
but the states of external bodies are changed. Hence, all pro- 
cesses accompanied by friction are irreversible ones. 

Example 2. The direct process of heat exchange (9.2.4) between 
two bodies having different temperatures is a spontaneous one. 
The reverse process, heating one body by cooling another that 
initially has the same temperature as the first body, cannot 
proceed spontaneously. To accomplish this process, a refrigerat- 
ing device (11.1.11) is required. The process of heat exchange 
at a finite temperature difference is an irreversible one. 


11.3 Second Law of Thermodynamics 


11.3.1 The reference in the preceding section to experiments 
that prove the irreversibility of the processes of heat exchange 
and motion with friction (11.2.2) was no chance one. The first 
law of thermodynamics (9.3.1) cannot describe thermodynamic 
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processes comprehensively. An essential limitation of the first 
law is that it cannot be used to predict in which direction a 
thermodynamic process will proceed. Any process that does 
not violate the law of conservation of energy is feasible from the 
point of view of the first law of thermodynamics. Among others, 
the spontaneous transfer of energy in the form of heat from a 
cooler body to a hotter body is also possible. Also feasible is a 
process whose only result would be the obtaining of a certain 
quantity of heat from a body and the conversion of this heat 
into an equivalent amount of work. A periodic-action device, 
based on the first Jaw of thermodynamics, which performs work 
by cooling a single source of heat (for instance, the internal 
energy of large bodies of water), is called a perpetual motion 
machine of the second kind. 

11.3.2 The second law of thermodynamics is the experimentally 
established statement of the impossibility of building a perpe- 
tual motion machine of the second kind (11.3.1). The two most 
common statements of the second law, equivalent to each 
other, are: 

(a) no process is possible whose sole result is the conversion of 
all the heat received from a certain body into an equivalent 
amount of work; 

(b) no process is possible whose sole result is the transfer of 
energy in the form of heat from a cold body to a hotter one. 
11.3.3 It follows from the second law of thermodynamics that 
work and heat are not equivalent forms of transferring energy. 
The conversion of orderly motion of a body as a whole into the 
random motion of its particles is an irreversible process, pro- 
ceeding without compensating processes (11.2.2). The conver- 
sion of random motion of the particles of a body into ordered 
motion of the body as a whole requires the provision of some 
simultaneous compensating process*. 

Example 1. In the isothermal expansion of an ideal gas work 
is done that is fully equivalent to the quantity of heat delivered 
to the gas (9.5.9). The heat received by the gas is completely 
converted into an equivalent amount of work. But the gas 
does not return to its initial state. It expands and its specific 
volume is increased. The conversion of heat into work* is not 
the sole result of the isothermal expansion of an ideal gas. 


* Such a process is sometimes, not quite correctly, called the 
“conversion of heat into work”. 
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Example 2. In a heat engine based on the direct Carnot cycle 
(11.1.8), work is done owing to the heat delivered from the 
heater. But a part of the quantity of heat obtained is given up to 
the cooler (41.1.7). Hence, the work done in a cycle is not equi- 
valent to the whole quantity of heat delivered to the working 
medium. 

Example 3. In refrigerating devices operating on the reverse 
Carnot cycle (11.1.11), a certain quantity of heat is transferred 
from a cold body to a hotter one. But external forces do work to 
accomplish this and, consequently, a compensating process is 
being carried out. 

11.3.4 The Carnot theorem states that the thermal efficiency 
(11.1.9) of the reversible Carnot cycle does not depend upon 
the composition of the working medium and is always ex- 
pressed by the formula (11.1.10): 


Qto: T1—T 
ee OG: Tr 
The thermal efficiency nc ir of the irreversible Carnot cycle is 
always less than the thermal efficiency yc rey of the reversible 
Carnot cycle, carried out at the same temperatures 7, and T,: 
Ncir < Ncrev- | 
The thermal efficiency of any reversible cycle cannot be higher 
than that nc of a reversible Carnot cycle being accomplished 
by means of a heater and cooler having the temperatures 7, 
and Ta: 
T,—T 
rev S 1a ° 
1 


11.3.5 The Carnot theorem (11.3.4) is used to establish the 
thermodynamic temperature scale.’ It follows from the first equa- 


tion in the preceding subsection that 7,/T,= —Q,/Q, or, since 
Q» < 0, 

Ta _*1Qel 

Tı Qi - 


In order to compare the temperatures 7, and T, of two bodies 
it is necessary to carry out a reversible Carnot cycle in which 
these bodies are the heater and cooler. Then, according to the 
ratio of the numerical values of the quantities of heat delivered 


11.4] ENTROPY AND FREE ENERGY = 185 


to and received from the bodies, the ratio of their temperatures 
is determined. The result of such a comparison of temperatures 
does not depend upon the chemical composition of the working 
medium (11.3.4). Therefore, the thermodynamic temperature 
scale is independent of the properties of the temperature-indi- 
cating substance (8.3.4) and, in this respect, has great generali- 
ty. However, since all real thermodynamic processes are irre- 
versible, the comparison of temperatures by the above-men- 
dene procedure is unfeasible; it has only a theoretical signi- 
icance. 


11.4 Entropy and Free Energy 


11.4.1 The ratio of the quantity of heat Q, received by a body 
in an isothermal process, to the temperature 7 of the body 
giving up the heat is called the reduced quantity of heat Q*, i.e. 
Q* = Q/T. In heating a body (Q > 0), Q* is positive; in cool- 
ing (Q < 0), Q* is negative. 

The reduced quantity of heat delivered to a body in an infi- 
nitely small portion of an arbitrary process, is equal to 6Q/T, 
where T is the temperature of the corresponding heater*. The 
reduced quantity of heat Q#_. for the arbitrary portion 1 — 2 of 
the process C,C, is 


Co 


11.4.2 The reduced quantity of heat Q*,, delivered to a body 
in any reversible cycle equals zero: 


ô 
rey = L =0. 


rev 


Here T is the temperature at which an element of the quantity 
of heat ôQ is delivered to the body. It follows from the equation 
in the preceding subsection that the function 5Q/T is a com- 


* In the case of a reversible process, T coincides with the tem- 
perature of the body that is carrying out the process. 
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plete differential of a certain function S [in contrast to 6Q, 
which is not a complete differential (9.4.3)]: 


si (T)ar 


The single-valued state function (9.1.3) S, whose complete diffe- 
rential is determined by the preceding equation, is called the 
entropy of a body. It is evident from this equation that dS and 
ôQ have the same sign. Consequently, from the way in which 
the entropy varies it is possible to determine in which direction 
heat exchange (9.2.4) takes place. In heating a body (6Q > 0) 
its entropy increases (dS > 0). If a body is cooled (ôQ < 0), 
its entropy decreases (dS < 0). 

Example. The complete differential of the entropy of an ideal 
gas is expressed by the equation 


dQ M aT M dV 

as=(-7-) Cwi p ea y 

where M is the mass of the gas, u is its molar mass (8.4.3), 
Cy, is its molar heat capacity at constant volume (9.5.4), R 
is the universal gas constant (8.4.4), T is its temperature, 
and V is its volume. This result is obtained on the basis of the 
first law of thermodynamics (9.3.1) for 6Q, with the Mendeleev- 
Clapeyron equation (8.4.4) taken into account. 

The change AS; in the entropy of an ideal gas going over 
from state 1 to state 2 does not depend upon the kind of pro- 
cess 1 — 2 used in this transition: 


rev u 


AS, .2=5,—S,=—— (Cv In FAHR In 7) : 


11.4.3 Certain of the most important properties of the entropy 
of closed systems (2.2.4) are: 

(a) The entropy of a closed system in a reversible Carnot cycle 
(11.1.6) does not change: 


MSc = 0 and S = const. 


(b) The entropy of a closed system increases in an irreversible 
Carnot cycle: 


ASir > 0. 
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(c) In any processes that occur in a closed system its entropy 
never decreases: 


AS > 0. 


In an element of change in the state of a closed system, the 
entropy does not decrease: dS > 0. 

The equality sign pertains to reversible processes and the greater- 
than sign to irreversible ones. Item (c) is one of the statements 
of the second law of thermodynamics. 

11.4.4 The following relation is valid for an arbitrary process 
occurring in a thermodynamic system (8.3.1): 


60 < T ds, 

where T is the temperature of the body that transfers the energy 
5Q to the thermodynamic system in the process of an infinitely 
small change in the state of the system. Making use of an equa- 


tion of the first law of thermodynamics (9.3.1) for 6Q, the 
preceding inequality can be rewritten in the form 


T dS > dU + ôW, 
combining the first and second laws. 


11.4.5 For a reversible process 

Sw = —(dU — T dS) 

or l 

ôW = —d (U — TS) — S dT = —dF — S dT, 
where F = U — TS 


is called the free energy. The free, or available, energy is the 
difference of two state functions (8.3.8) and is therefore also a 
state function of the thermodynamic system. 
If the system carries out a reversible isothermal process, dT = 
= 0 and ÔWisot = —dF. When the system passes from state 1 
to state 2 in a reversible isothermal process, 


Wisot = Fi — Fo. 


The decrease in free energy is the measure of the work done by a 
system (or body) in a reversible isothermal process. 
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11.4.6 It follows from the equation U = F 4+ TS that the 
internal energy of a body (or system) is equal to the sum of the 
free energy F and the bound energy TS. The bound, or unavail- 
able, energy is the part of the internal energy of a body (or 
system) that cannot be transferred in the form of work in an 
isothermal process. In this sense, this part of the internal energy 
is “depreciated”. The more the entropy of the body (or system), 
the more the bound energy. Therefore, the entropy of a body 
(or system) serves as a measure of the “depreciation” of its 
energy. 


11.5 Statistical Interpretation of the Second Law 
of Thermodynamics 


11.5.1 The statement of the second law of thermodynamics on 
the impossibility of a decrease in the entropy in an isolated 
system (11.4.3) can be interpreted statistically, on the basis 
of the molecular-kinetic theory of the structure of matter, by 
means of Boltzmann’s relation: 


S = k ln P + const, 


where S is the entropy of the system, k is Boltzmann’s constant 
(8.4.5), and P is the thermodynamic probability of the state. 
11.5.2 The thermodynamic probability P of a state of a body (or 
system) is equal to the number of all possible kinds of coordi- 
nate and velocity distributions of its particles corresponding to 
the given thermodynamic state (8.3.3). By definition, P is a 
whole number not less than unity (P > 1). The Boltzmann rela- 
tion (11.5.1) leads to the following statistical interpretation of 
the second law of thermodynamics: the thermodynamic prob- 
ability of a state of a closed system cannot decrease, regardless 
of the processes that occur in it. In any process that proceeds 
in a closed system and brings it from state 1 to state 2, the 
change AP in the thermodynamic probability is either posi- 
tive or zero: AP = P, — P > 0. 

In a reversible process AP = 0, i.e. the thermodynamic prob- 
ability P is constant. If the process is irreversible, AP > 0 
and P increases. This means that an irreversible process trans- 
fers the system from a less probable state to a more probable one 
and, in the limit, to an equilibrium state (8.3.3). | 


~ 
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11.5.3 Being a statistical law, the second law of thermodyna- 
mics describes the laws of random motion of a great number of 
particles making up a closed system. In systems that consist of 
a relatively small number of particles, fluctuations (11.6.1) are 
observed. They are deviations from the second law of thermo- 
dynamics. 

41.5.4 The second law of thermodynamics, established for 
closed systems on the earth, cannot be applied to the whole infi- 
nite universe. Such an application leads to the conclusion that 
the temperatures of all bodies in the universe equalize. This is 
incorrect from both the philosophical and physical points of 
view. According to this erroneous conclusion, all forms of 
motion, except random thermal motion, will cease and the 
universe will then reach a state of so-called “heat death”. Actu- 
ally, owing to the infinity of the universe, fluctuations n 
are inevitable in certain of its parts. These violate therma 
equilibrium. The duration and magnitude of the fluctuations 
can be extremely great. It has been proven that there can be 
no equilibrium state, corresponding to “heat death”, for the 
infinite universe. 


11.6 Fluctuations 


11.6.1 Substantial deviations of certain physical quantities, 
characterizing a system, from their average values may occur 
in systems consisting of a comparatively small number of par- 
ticles. Such deviations are called fluctuations of physical quan- 
tities. For instance, in highly rarified gases the density at var- 
ious places in the volume of the gas may differ from the aver- 
age density corresponding to the equilibrium state for definite 
p and T values. Chance deviations of the temperature T, pres- 
sure p and other physical quantities may occur in exactly the 
same way. 

11.6.2 If M is the true value of a physical quantity, and (M) 
is its average value, then the quantity AM, where AM = 
—= M — (M), and its average value (AM) = (M — (M)) 
cannot he measures of the fluctuation of the quantity M. The 
quantity AM is not constant with time, and the quantity 


(AM) = (M) — (M) = 0°. 


* Used here is the statement that the mean value (M) of a con- 
stant quantity coincides with the quantity. 
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This last equation follows from the fact that the deviation of 
the quantity M from (MY occurs in both directions, towards 
values larger than the average and towards values smaller than 
the average, with equal frequency. 

11.6.3 A measure of the fluctuations of the physical quantity 
M is the mean square of the difference AM. It is called the 
mean square fluctuation: 


((AM)?) = (M — AM)*) = (M?) — ((M))2* 


The mean square fluctuation is essentially positive or zero: 
((AM)?) > 0. 

The absolute fiuctucticn is the quantity V ((AM)2), which also 
characterizes the deviation of M from (M). A small absolute 
fluctuation signifies that large deviations of M from (M) occur 


very rarely. 
The relative fluctuation Ôm is the ratio of the absolute fluctuation 


to the average value (M) of a physical quantity: 
5 = —V MAM) 
(M) l 
The relative fluctuations in the concentration of particles (or 
density) of a gas, and in its pressure and temperature, decrease 


with an increase in the number N of gas molecules contained in 
the vessel: 


_ _V ((p)?) 1 5. — V (Ap)*) 1 
a 


ôo = ~~) VN’ (SVR 
VUO „i 
en CYN 


When N = Nag (Avogadro’s number, see Appendix II), p, 5p 
and ôr have values of the order of 10-14. 


* This last equation, following from the rules of algebraic oper- 
ations with average values, underlines the fact that the average 
value (M?) of the square of a quantity should not be confused 
with the square of the average value ({M))?. 
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If a system consists of N independent particles, the relative 
fluctuation of an additive state function (9.1.3) of the system 
is inversely proportional to the square root of N: 


1 

6 —— | 

mM OX VN 

11.6.4 The following are examples of fluctuations of physical 
quantities. 
Example 1. In measuring the temperature by means of a gas 
thermometer filled with an ideal gas (8.4.1), the thermometer 
readings are not constant owing to fluctuations of temperature. 
The variations in temperature measured by the thermometer 
cannot be less than the absolute fluctuation of the thermometer 
indication, equal to y ((AT)?), i.e. At > V ((AT)?). 

E absolute fluctuation can be determined by the formula in 
11.6.3: 


TANGO ee 
AT)? —, 
V AT} & Vi 


Thus 


At > VD) X a 


If the gas thermometer contains 10-8 mole of gas, i.e. N = 
= 6.02 x 1015, then the minimum changes in temperature Aż. 
that can be detected by the instrument, are of the order of mag- 
nitude Aż % 10-1 (T. This value indicates the limit of sen- 
sitivity of the gas thermometer. 

Real variations in temperature, commonly encountered in 
experiments, are incommensurably larger than At. 

Example 2. Electrical fluctuations in circuits restrict the lim- 
its of sensitivity of radio receiving equipment. Among others, 
fluctuations in the number of electrons emitted by the incandes- 
cent cathode lead to fluctuations in the current supplied to the 
vacuum tube. This is called shot, or Schottky, noise. A measure 
of the shot noise is the mean square fluctuation of the current: 


(an) 222, 


192 SECOND LAW OF THERMODYNAMICS jii 


where e is the charge of the electron, and J, is the average cur- 
rent over the time ¢ during which it is measured. Here t > T, 
where Tt is the time of flight of the electron in the tube. 


11.7 Brownian Movement 


11.7.4 The Brownian movement is the continuous haphazard zig- 
zag motion of small particles suspended in a gas or liquid and 
can be observed in a microscope. The Brownian movement is 
caused by fluctuations (11.6.1) in the pressure exerted by the 
molecules of gas or liquid on the surfaces of the minute suspend- 
ed particles. As a result of pressure fluctuations the Brownian 
particles are subject on all sides to the action of unbalanced 
forces which cause their complex motions that are observed. 
11.7.2 Under constant external conditions, no changes are 
observed in the motions of the Brownian particles and they 
continue indefinitely. This is testimony to the continuous na- 
ture of random thermal motion of the molecules to which the 
displacements of the Brownian particles are due. The velocity 
v and energy E of motion of these particles depend upon their 
size, but not on their chemical composition. The values of v 
and E increase with the temperature and with a reduction in 
the viscosity of the gas or liquid. 

11.7.3 It has been experimentally established that Brownian 
particles can move upwards, rising in the fluid. This occurs 
when the molecules of the gas (or liquid) transmit an uncom- 
pensated momentum directed upwards to the particle. This 
increases the potential energy of the particle at the expense of 
the kinetic energy of the molecules surrounding it, thereby 
leading to localized cooling of the gas or liquid. This means that 
the mechanical energy of the Brownian particle has increased 
by cooling a single source of heat, the liquid or gas, thereby 
contradicting the second law of thermodynamics (11.3.2). In 
this manner, Brownian movement is proof of the limited appli- 
cation of the second law of thermodynamics and of its statisti- 
cal nature (11.5.1). 

11.7.4 The motions of Brownian particles are completely chao- 
tic. Therefore, the average displacement (x) of a particle along 
an arbitrary direction equals zero. Then mean square displa- 
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cement (z? is proportional to the time ż of observation of the 
particle and is expressed by Finstein’s equation: 


(17) = 2D. 


where D is the diffusion coefficient of the Brownian particles. 
For spherical particles of radius r 


RT 
Aa 6nnrN a ° 


Iere T is the absolute temperature, R is the universal gas con- 
stant (8.4.4), ņ is the coefficient of viscosity of the liquid or gas 
(10.8.4), and Na is-Avogadro’s number (Appendix II). 


14.8 Third Law of Thermodynamics 


11.8.1 As a single-valucd state function of a system entropy is 
introduced by means of a differential relation (11.4.2). Hence, 
the entropy can be determined up to an arbitrary constant, 
which cannot be found from the first and second laws of thermo- 
dynamics. It is consequently impossible to determine the 
absolute value of the entropy. 

11.8.2 An experimental investigation of the properties of 
substances at ultralow temperatures led to the establishment of 
the third law of thermodynamics. This law, also called the Nernst 
heat theorem, states that the change in entropy equals zero in 
any isothermal process conducted at absolute zero: 


AS pg == 0 and S = Sy == const 


regardless of the changes in any other state variables (for example, 
volume, pressure, and intensity of the external force field). 
The third law does not enable the absolute value of the entropy 
lo be determined. But the fact that the entropy is constant as 
T —> 0 permits this constant value to be employed as the ref- 
erence point for entropy values. This enables the change in en- 
tropy to be determined in processes being investigated. 

i{.8.3 Planck extended Nernst’s heat theorem by stating that 
the entropy of a system equals zero at absolute zero tempera- 
lure. If Ey, Ei, ..., En is a sequence of energy levels (38.2.5) 
ol a system, then at absolute zero temperature the system in 
equilibrium is in a quite definite lowest state with energy exact- 
ly equal to E. Here the thermodynamic probability P of the 
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state (41.5.2) is equal to unity (P = 1) and, according to Boltz- 
mann’s relation (11.5.1), 

So = k ln P = 0. 

11.8.4 The heat capacity Cy, at constant volume (9.5.4) for 
all bodies vanishes at T = 0 K. As a matter of fact, if the 
temperature of a system is sufficiently low, so that the average 
kinetic energy kT of a particle (10.6.4) is substantially less 
than the difference AE between the lowest and first energy lev- 
els (AE >> kT), then the thermal excitation of the system is 
insufficient to transfer it from the state with the energy E, to 
one with the energy E. Therefore, at ultralow temperatures the 
system should be in a state having the least energy Æ. The 
internal energy U, of the system (9.1.2) is equal to Eg, i.e. 
U, = E,. Hence the heat capacity of the system at constant 


volume 
_ {aU __ ( dE o 
Cyu = (sr) =( ar ) =° as T —Q. 
The coefficient of cubic expansion (40.2.3) also vanishes at 


absolute zero temperature. 

It follows from the third law that a process in which a body 
could be cooled to absolute zero temperature is impossible 
(principle of the unatiainability of absolute zero temperature, 
sometimes called the Nernst principle). 


CHAPTER 12 REAL GASES AND VAPOURS 


12.1 Forces of intermolecular Interaction 


12.1.1 The properties of gases that are not highly rarified differ 
from those of ideal gases, which comply with the Mendeleev- 
Clapeyron equation (8.4.4). Experiments indicate that the 
specific heat capacities (9.5.2), coefficients of viscosity (10.8.4) 
and other quantities have different values for real gases than 


for ideal gases. 
12.1.2 A real gas is one in which forces of molecular interaction 


exist between the molecules. 
A vapour is a real gas in states close to that in which it con- 


denses. 
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12.1.3 The forces of intermolecular interaction decrease rapidly 
with an increase in the distances between the molecules (short- 
range forces). At intermolecu- 
lar distances exceeding 10-9 m, 
these forces can be neglected. 
Forces of interaction between 
molecules can be subdivided 
into forces of attraction and 
jorces of repulsion. Both types act 
simultaneously. Otherwise it 
would be impossible to deter- 
mine the volume of liquids or 
solids: their constituent par- 
ticles would either fly apart in 
all directions or would stick 
together, forming the minimum 
possible volume. 

12.1.4 The forces of mutual 
attraction and repulsion depend 
differently on the distance r 
between the molecules. At dis- 
tances commensurable with 
the linear dimensions of atoms Fig. 12.1 

and small inorganic molecules 

(10-1° m), repulsive forces F, pre- 

dominate; at distances r of the order of 10-9 m forces of mutual 
attraclion F, prevail. If r is the radius vector to the point 
where molecule A is located from a point where molecule B is 
located and acts on molecule A with the forces F} and F,, then 


r r 
Fi=Fir py and F= For —. 


Projections Fır and Fər of forces F, and F, on the direction of r 
depend upon r in the following way 


Fir = b/ri8 and For = —alr’, 


where a and b are factors that depend on the structure of the 
molecules and the type of intermolecular interaction. The curves 
in Fig. 12.14 show the dependence of Fir and For on r. Forces 
of repulsion are conventionally accepted as positive and at- 
tractive forces as negative (Fig. 12.1). 


13% 
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The resultant force is 
r 
F=F,+F,=F,— , 


and F, = Fy, + Fər. The dependence of F, on r is also shown 
in Fig. 12.1. 

12.1.5 At r= rọ, forces F, and F, counterbalance each other 
and F = 0. At r > r force F, > F}; at r < ro force F, < Fy. 
Thus ry is the equilibrium distance between the molecules and 
where they would be with respect to each other if there were 
no thermal motion. 

12.1.6 The element of work 6W done by the resultant force F, 
when the distance between the molecules is increased by the 
amount dr, is equal to the decrease in the mutual potential 
energy Ep of the two molecules (3.3.1): 


ôW = (F, dr)= F, dr = —dEp. 
Integrating with respect to r from r to oo: 
E po 


(čan = — | F dr and Ep—Epo= j F, dr. 
Ey’ T r 


At r = oo the molecules do not interact and Epo = 0. Hence, 
oo 

Ey = | F, dr. 
; 


This integral can be calculated graphically from the known de- 
pendence of F, on r (Fig. 12.1). It is proportional to the area 
bounded by the curve Fp = Fr(r), the r axis and the value of r 
(where r = const) for which Ep is being calculated. At r > ro, 
the energy Ep < 0 because Fp < 0; at r = ro, the energy Ep 
reaches its minimum value: Lp = Emin. This follows from 
the equation 


dEp , í 
( ar ) ap, = Pr (ro) = . 
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A system consisting of the interacting molecules, in a state of 
stable equilibrium (r= ro), possesses minimum potential energy. 
At r<ro, the energy Ep begins to increase, becomes positive 
and then rises drastically due to the rapid increase in the 
repulsive forces as r drops (Fig. 12.2). 

12.1.7 The value E = Ep min, the minimum potential energy 
of interaction of molecules, is the criterion for various states 
of aggregation of matter. 

If | E |p min < kT, the substance is in a gaseous state. When 
| E lp min >> kT, it is in the solid state. The condition 


P 


LAE 


Fig. 12.2 


| E lpmin % kT corresponds to the liquid state. Here kT is 
twice the average energy per degree of freedom of thermal mo- 
tion of the molecules (10.6.4). 

12.1.8 Van der Waals forces are weak* forces of attraction that 
act between molecules at distances of the order of 10-® m (12.1.4). 
They are the reason for the correction of the internal pressure 
in the Van der Waals equation of state for a real gas (12.2.4). 
Distinction is made between three types of Van der Waals forces 
of attraction. 

(a) Orientational forces of attraction Fo, are due to the fact 
that asymmetric polar molecules (18.1.4) have constant dipole 


* Weak in the sense of their small magnitude compared to the 
forces of attraction that provide for the formation of stable 
molecules (12.1.9). 
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moments pe (18.1.4). The result of interaction of these dipole 
moments is the attraction force For. This force is opposed by 
the thermal motion of the molecules. Calculations indicate 
that 

Pe 1 
Er aa 


where r is the distance between the molecules, k is Boltzmann’s 
constant (8.4.5), and 7 is the absolute temperature. 

(b) Induction forces of attraction Fin are exerted when a neutral 
molecule of a real gas is in an electric field set up by another 
molecule, both molecules having high polarizability (48.1.3). 
If the molecules are sufficiently close together, the action of the 
electric field of the second molecule develops an induced dipole 
moment Pe = &,PpE (18.1.4) in the first molecule, where p is 
the polarizability of the molecule (18.1.3), and E is the strength 
of the electric field set up by the second molecule. To constant 
factors 


1 
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(c) The vibration of electrons in one molecule (or atom) can 
excite vibration in another molecule (or atom). The vibrations 
of the electrons in the two molecules are in phase with each 
other, leading to attraction between the molecules (or atoms). 
The resulting resonance forces of attraction are called disper- 
sion forces. They play the main role in the interaction of non- 
polar molecules (18.1.3). The force of dispersion attraction is 


ethyvo 1 
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where e is the charge of the electron, k is Planck’s constant, 


yo = (1/27) V k/m is the frequency of vibration of the atoms 
(28.1.2), k is the coefficient of the quasi-elastic force (40.3.7), 
and m is the mass of the atom. 

Dispersion forces owe their name to the analogy between the 
origin of these forces and dispersion phenomena occurring 
when electromagnetic waves propagate through matter (34.4.1). 
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12.1.9 The attraction force F, between two molecules, shown 
in Fig. 12.4, is the resultant of all the kinds of attraction enu- 
merated in Sect. 12.1.8. The potential energy of Van der Waals 
attraction amounts to (0.4 to 4) X 10% J/mol. 

In addition to the electromagnetic interaction, a special kind 
of quantum interaction exists between molecules at distances 
r < 10-10? m. This interaction leads either to the development 
of repulsive forces between the molecules (12.1.4), or to strong 
attraction of adjacent atoms (or groups of atoms) and the for- 
mation between them of ionic and covalent chemical bonds 
(39.4.4 and 39.4.5). The result of such bonds is the formation 
of stable molecules. The potential energy of chemical bonds 
exceeds the energy of. Van der Waals attraction and ranges from 
(0.4 to 4) X 10% J/mol. 

12.1.10 Being internal forces (2.2.4), the forces of interaction 
in a system of two molecules cannot change the total energy E 
of the system, which is the sum of the kinetic energy Ex of the 
molecules and their mutual potential energy Ep. Hence 


dE = dE, + dEp = 0 or dEy = —dEp = Fr dr. 


Use is made here of the equation in Sect. 12.1.6. 

When the molecules are brought closer together (dr < 0), to the 
distance ry (Fig. 12.1), the potential energy Ep decreases and 
the kinetic energy Ey accordingly increases. This occurs at the 
expense of the positive work done by the resultant force of mu- 
tual attraction between the molecules (12.1.4). (Fr < 0 when 
r > r). As the molecules are brought still closer, they do work 
in overcoming the resultant force of mutual repulsion (Fr > 0 
when r < rọ). This reduces the kinetic energy of the molecules. 
At the instant they are closest together (r = rı in Fig. 12.2) 
all of the kinetic energy of the molecules has been completely 
expended in doing work to overcome the repulsive forces: 
ky = 0 and the total energy E is equal to the potential energy 
Ep, ie. E = Ep (Fig. 12.2). 

If all the state variables of real gases are maintained constant 
with the exception of the temperature, the distance r, decreases 
when the gas is heated. This decrease is very small, however, 
even at high temperatures. This is due to the extremely steep 
increase in the repulsive forces F} as r is decreased (Fig. 12.1). 
Thus the distance r, is the effective diameter d of the molecule 
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(8.4.1). The finite dimensions of molecules of real gases are 
explained by the action of repulsive forces between the mol- 
ecules. 


12.2 Van der Waals Equation of State 


12.2.4 A Van der Waals gas is a model of a real gas in which 
the molecules are dealt with as perfectly rigid spheres of diam- 
eter d (12.1.10), with forces of mutual attraction acting between 
them. The finite size of the spheres signifies that the repulsive 
forces between the molecules of a real gas are also taken into 
account. 

12.2.2 The molecules of a real gas, each having a volume of 


v = md3/6, do not move around as freely in a vessel as the 
“point” molecules of an ideal gas. Therefore, in the Mendeleev- 
Clapeyron equation (8.4.4) pV, = RT, the total volume V, ofa 
vessel holding one mole of the gas should be replaced by the 
“free” volume 


where b is Van der Waals’s correction for the intrinsic volume 
of the molecules. Correction b is equal to four times the volume 
of all the molecules contained in one mole of the gas: 


b = 4N AD, 


where NA is Avogadro’s number (Appendix II), and v is the 
volume of a single molecule. 

12.2.3 The forces of mutual attraction between the molecules 
are taken into account for a Van der Waals gas by introducing 
a correction for the pressure in the Mendeleev-Clapeyron equa- 
tion (8.4.4). Owing to the short-range nature of the attraction 
forces (12.1.3), each molecule interacts only with particles that 
are located at distances r< Rm, where Rm is the range of mol- 
ecular aclion, with a magnitude of the order of 10-° m. A sphere 
of radius Ry is called the sphere of molecular action. 

The forces of attraction of a molecule within the bulk of the 
gas to other molecules are mutually counterbalanced and have 
no effect on the motion of the given molecule. If the molecule 
is within a layer of gas that borders on a wall of the vessel, it is 
subject to an uncompensated force of attraction directed in- 


12.2] VAN DER WAALS EQUATION 201 


ward, into the bulk of the gas. Consequently, in its collision 
with a wall, such a molecule imparts less momentum to the 
wall (10.2.1), and the pressure p exerted by a real gas on the 
walls is reduced in comparison to the pressure pig of an ideal 
gas having the same density at the same temperature, 


p = Pid — p*, 
or 
Pid = p + p*, 


where p* is the Van der Waals correction due to the forces of 
mutual attraction and is called the internal pressure. The inter- 
nal pressure p* is inversely proportional to the square of the 
volume V,, of the vessel occupied by one mole of the gas: 


a 
pee a 
Vi 


where a is the Van der Waals coefficient and depends upon the 
chemical nature of the gas. 

12.2.4 The Van der Waals equation describes the state of a real 
gas and differs from the Mendeleev-Clapeyron equation (8.4.4) 
in that the corrections b and p* have been introduced (12.2.2 
and 12.2.3). For a single mole of gas it is of the form 


(e+) (Vy—b) = RT. 


12.2.5 The Van der Waals equation for an arbitrary mass of 
real gas, having the molar mass u (8.4.3), is 


M? a, M \_ M, 
r wr) Ua n) aa RT. 
This equation is valid for gases that are not too highly com- 
pressed. In highly rarefied gases V,,> b, p* < p and the Van der 
Waals equation does not differ from the Mendelcev-Clapeyron 


equation, 
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42.3 Isothermals of Real Gases. Phase Transitions 


12.3.1 The curve representing the dependence of the molar 
volume of a gas on the pressure at constant temperature is called 
the isothermal, or isotherm, of a real gas. Isothermals of carbon 
dioxide gas are shown in Fig. 12.3. At temperatures T below 
Te = 340 K, all the isothermals have horizontal steps along 
which, in addition to the constant temperature, the pressure is 
also constant while the molar volume varies. The difference 
Vp — Vp of the molar volumes over the horizontal steps de- 
creases as the temperature is raised (Fig. 12.3). At T = Te this 
difference vanishes. The temperature T = Te, corresponding to 
the condition Vp — Vp = Q, is called the critical temperature. 
The isothermal of a real gas at T = Te is called the critical 
isothermal. On this isothermal points D and B merge into 
point C, which is called the critical point. The state variables 
(8.3.2) of the gas at the critical point are the critical variables 
Vuc» Pe and To. 

Critical point C is an inflection point on the critical isothermal. 
Consequently, the tangent to the isothermal at this point is 
parallel to the OV, axis. 

12.3.2 Any subcritical isothermal (T < Te) is a curve repre- 
senting the continuous transition of the substance from the 
gaseous to the liquid state. It consists of three portions: TD, 
DB and BA, each of which represents different states of the 
substance. Over the portion TD the substance is in the gaseous 
state, portion DB corresponds to the transition of the substance 
from the gaseous to the liquid state. Over the portion BA of 
the isothermal, the substance is a liquid. This portion is almost 
vertical owing to the low compressibility of liquids. Points D 
and B of the horizontal portion of the isothermal correspond 
to the beginning and end of condensation in isothermal compres- 
sion of a real gas. On the contrary, in isothermal expansion of 
the liquid, points B and D are the beginning and end of the 
boiling process. Point B corresponds to'the state of a boiling liq- 
uid; point D to dry saturated vapour. The mixture of the boiling 
liquid and dry saturated vapour that exists at any point M of 
portion BD is called wet vapour (Fig. 12.3). 

12.3.3 In thermodynamics a phase is a totality of all the parts 
of a system, having the same chemical composition and being 
in the same state. Wet vapour, for instance, is a two-phase sys- 
tem consisting of boiling liquid and dry saturated vapour. 
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Shown in Fig. 12.4 are two boundary curves BC and DC which 
connect points B and D (Fig. 12.3) at various temperatures. 
Curves BC and DC converge at critical point C. Boiling curve 
BRC separates the single-phase liquid region J from the two- 
phase wet vapour region JI. The curve BC is one representing 
(he beginning of the first-kind phase transition of the substance 
from the liquid to the gaseous state. The boundary curve DC 
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Fig. 12.3 Fig. 12.4 


separates the two-phase region JI from the single-phase region 
[II in which the substance is in the gaseous state. 

12.3.4 The two-phase region JI cannot exist at pressures above 
the critical one pe (12.3.1) at which the substance can be in 
cither of two ‘states, gaseous or liquid. No pressure can convert 
gas at a temperature above the critical Te into the liquid state 
by isothermal compression. The critical temperatures of many 
vases are very low: Te ~ 5 K for helium and 33 K for hydro- 
von. This makes the liquefaction of such gases quite difficult. 
{2.3.5 Besides the difference in the molar volumes of boiling 
liquid and dry saturated vapour, the specific heat of vaporiza- 
tion (13.6.4) also vanishes when a substance is in the critical 
state, as does the surface tension (13.4.4). The difference be- 
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tween the liquid and gaseous states of a substance completely 
disappears when it is in the critical state. 

{2.3.6 The Van der Waals equation (12.2.4) is a cubic equation 
with respect to the molar volume V,, having factors that de- 
pend upon the pressure, temperature and the chemical nature 


Vig. 12.5 


of the gas. This equation has either one or three real roots, in 
accordance with the numerical values of p and T. Shown in 
Fig. 12.5 are the isothermals of a real gas that complies with 
the Van der Waals equation. The isothermals correspond to the 
different temperatures 7, < Ta < T < T} < Te < T; < Te. 
All the subcritical isothermals have a hatched portion where 
cach pressure corresponds to three different states, which are 
represented by the three points, B, E and G, of the isothermal. 
The wave-shaped portion BDEFG of the isothermal in Fig. 12.5 
more accurately represents the transition of the substance from 
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the gaseous to the liquid state than the horizontal portions of 
experimentally plotted isothermals (dash straight lines BG). 
The portion BD of the isothermal corresponds to a superheated 
liquid, which can be obtained if the beginning of boiling is de- 
layed at point B. The portion GF of the isothermal represents 
the state of supersaturated vapour, occurring upon slow isother- 
mal compression in the absence of condensation centres. If such 
centres (dust specks or ions) are introduced into the supersatu-. 
rated vapour, extremely rapid condensation of the vapour takes 
place. Over the portion DEF, the molar volume increases (de- 
creases) as the pressure is increased (decreased). Such states of 
matter are impossible. The horizontal steps BG cut the portions 
BDEFG of the isothermals in such manner that the hatched areas 
BDEB and EFGE in Fig. 12.5 are equal to each other (Maz- 
well’s equal area rule). 

12.3.7 The values of the critical variables pe, Vic and Te (12.3.1) 
can be expressed in terms of the coefficients a and b in the Van 
ae yo equation (12.2.4) and the universal gas constant R 
(8.4.4): 


1 a 8 a 
Pe ~O7 pe? Vuc = 3b and To= 57 OR ° 


12.4 Superfluidity of Helium 


12.4.1 Helium is a kind of substance which, after going over 
from the gaseous to the liquid state at standard atmospheric 
pressure and a temperature of 4.22 K, remains in the liquid state 
no matter how close to absolute zero (11.8.4) it is cooled. 
12.4.2 At the temperature 2.19 K a second-order phase transi- 
tion occurs: liquid helium I, existing at temperatures T > 
> 2.19 K, undergoes the transition to liquid helium II, 
which exists at T < 2.19 K. 

A second-order phase transition is a transformation of matter 
that is not associated with the evolution or absorption of heat 
as in a first-order transition. In second-order phase transitions 
the heat capacity, coefficient of thermal expansion and certain 
other characteristics of substances vary with discontinuous 
jumps. Some examples of second-order phase transitions are 
the transition of iron at the Curie point (26.5.2) from a ferro- 
magnetic substance into a paramagnetic one (26.3.5) and the 
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transition of cerlain metals and alloys al extremely low tem- 
peratures to the superconductive state (41.6.1). 

12.4.3 Superfluidity, a phenomenon observed in liquid he- 
lium IJ, is the practically complete absence of viscosity when 
such helium flows through narrow capillary tubes (13.5.5). 
The coefficient of viscosity (10.8.4) of helium II is less than 
10-12 Pa-s, while that of helium I close to the temperature 
4.22 K is of the order of 10-6 Pa-s. 

12.4.4 According to the two-fluid theory, liquid helium II isa 
mixture of a superfluid component and a normal component. 
The superfluid component moves without friction and does not 
participate in the transfer of energy in the form of heat (9.2.1). 
The normal component moves with friction and does participate 
in energy transfer. 

12.4.5 The theory of superfluidity is based on quantum mechan- 
ics (38.1.2). In the first place, quantum mechanics was required 
to explain why helium is the only nonfreezing liquid at ultra- 
low temperatures and standard pressure. The zero-point vibra- 
tions (38.5.6) of the light-weight atoms of helium are sufficient- 
ly intensive and do not allow the weak forces of attraction 
between the helium atoms to form a crystalline structure at 
ordinary pressures. : 
The present-day theory of superfluidity is based on a study of 
the energy spectrum of helium at ultralow temperatures. Under 
these conditions the continuous energy spectrum can be regarded 
as the totality of quasi-particles with the energies hv;, where h 
is Planck’s constant and v; is the phonon frequency (41.7.5) 
corresponding to these quasi-particles. The quasi-particles 
cannot be identified with the real atoms of helium. They repre- 
sent the motion of the whole system of helium atoms, and the 
energy values of the quasi-particle describe the energy spectrum 
of the whole quantum system, i.e. the liquid helium as a whole. 
At low temperatures the excited state of the helium constitutes 
sound waves that are quasi-particles in the normal part of he- 
lium II. Associated with the quasi-particles are the store of 
internal energy and the presence of friction in the liquid he- 
lium. But states are possible in helium IIt hat correspond to its 
superfluid part, in which quasi-particles are not advantageous 
from the energy point of view and do not arise. As a result of 
the strong interaction between the particles of the superfluid 
part of helium II a bound body is formed in which no thermal 
excitations are generated: the superfluid part of helium II has 
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no store of internal cnergy and is not subject to viscosity. At 
absolute zero temperature, there should actually not be any 
normal part in helium IJ; all of the helium should be superfluid. 
As the helium is heated, the number of phonons increases as 
does the share of the normal part of helium II. But until the 
temperature reaches 2.19 K, a superfluid part of helium II, 
with its characteristic properties, still remains. At the temper- 
ature 2.49 K, helium II is converted into helium I and all the 
special properties of helium II disappear. 


CHAPTER 13 LIQUIDS 


13.1 Certain Properties of Liquids 


13.1.1 Liquids are bodies that have a definite volume, but assume 
the shape of the vessel that holds them. On the thermal mo- 
tion in liquids, see Sect. 8.1.5. The nature of the thermal mo- 
tion in liquids determines the similarity of their properties with 
those of both solids and gases. Like solids, liquids are almost 
incompressible. This property is associated with the strong 
intermolecular interaction of the particles in liquids. When liq- 
uids are compressed, the distances between the molecules are 
reduced and the repulsive forces that impede compression in- 
crease drastically (12.1.4). Liquids have relatively high densities 
and, like solids, resist, not only compression, but tension as 
well (40.3.6). This is ‘manifested in the fact that the Van der 
Waals isothermals enter the region of negative pressures (the 
isothermal at T = T, in Fig. 12.5). The resemblance of the 
properties of liquids and real gases at high temperatures and 
low densities is observed, for instance, in that the surface ten- 
sion of liquids (13.4.4) and the specific heat of vaporization 
(13.6.4) are reduced as the temperature is raised. Besides, the 
densities of dry saturated vapour (12.3.2) and of boiling liquid 
(12.3.2) approach each other as the temperature is increased. 
13.1.2 The similarity between liquids and solids is confirmed 
by X-ray diffraction analysis (33.4.1). At temperatures close 
to that of solidification, the arrangement of the particles in 
liquids is similar to the ordered arrangement of particles that is 
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typical of solidified liquids. The mutual arrangement of neigh- 
bouring particles in liquids resembles the ordered arrangement 
of neighbouring particles in crystals. In liquids, however, this 
ordered arrangement is observed only within small volumes. At 
distances r > (3 to 4) d from a certain selected “central” mol- 
ecule, order is no longer evident (here d is the effective diameter 
of the molecule, see Sect. 8.4.1). Such order in the arrangement 
of the particles is called short-range order in liquids. 

13.1.3 X-ray patterns of liquids do not differ from those of 
polycrystalline bodies (33.4.5) consisting of extremely fine 
crystals (having linear dimensions of the order of 10-9 m) that 
are randomly oriented with respect to one another in so-called 
cybotaxic regions. Within the boundaries of these regions the 
particle distribution is ordered, but the kind of order changes 
from one cybotaxic region to another. In the course of time, 
intensive thermal motion at temperatures that are not too low 
rapidly alters the arrangement and structure of the cybotaxic 
regions. 


13.2 Frenkel’s Hole Theory of the Liquid State 


13.2.1 The most important parameter determining the structure 
and physical properties of liquids is the specific volume. In 
melting a crystalline body the specific volume increases only 
slightly, by approximately 10 per cent. Such an increase in the 
specific volume takes place in a solid subject to negative pres- 
sure equal to the ultimate strength (40.3.7) of the solid. This 
enables a liquid to be dealt with as a body whose integrity has 
been violated at various places. In melting crystalline bodies, 
their particles acquire higher mobility. This is due to the vital 
property of fluidity of liquids, as well as the violation of long- 
range order in crystals and the emergence of short-range order in 
liquids (43.1.2). Moreover, owing to the high mobility of the 
particles, microscopic gaps or microvoids, which shall be 
called holes, appear in liquids. The thermal motion in liquids 
causes the holes to disappear at random at certain places and to 
appear simultaneously at others. This is equivalent to chaotic 
movement of the holes. 

13.2.2 The hole theory of the structure of liquids is inapplicable 
to liquids subject to high external pressures of the order of 
thousands of atmospheres, at which the compressibility of 
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liquids is comparable to that of solids. At high temperatures, 
close to the critical one*, the structure and properties of a liq- 
uid are close to those of a gas for which the concept of holes 
is meaningless. Hence, the hole theory is inapplicable for 
liquids under these conditions. 

13.2.3 It follows from the nature of thermal motion in liquids 
(8.1.5) that the molecules vibrate about a certain equilibrium 
position for the length of time t, after which this equilibrium 
position is displaced with a jump over a distance of an order of 


magnitude equal to the average distance (d) between adjacent 
molecules: 
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where n, is the number of molecules in unit volume, N4 is 
Avogadro’s number (Appendix II), p is the density of the liquid, 
and wis its molar mass. For water, for example, p = 10° kg/m3, 
u = 0.018 kg/mol and (d) ~ 3 x 10- m. 
13.2.4 The relaxation time is the average time (t) that a mol- 
ecule of the liquid remains in a temporarily settled state close to 
a certain equilibrium position. As the temperature is raised 
(t) decreases rapidly (43.2.5). This explains the high mobility 
of molecules in a liquid at high temperatures and the low vis- 
cosity of liquids under these conditions. 
13.2.5 The expenditure of a certain amount of activation energy 
i’ is required for a molecule to pass from one equilibrium posi- 
lion to another (13.2.4). Such a passage is regarded as overcom- 
ing a potential barrier of height E (38.7.1) because to accom- 
plish the passage the potential energy of the molecule should 
increase by the amount Æ. Only after this can it pass over to a 
new equilibrium position. The whole process turns out to be 
possible because high energy is concentrated on some of the 
molecules as a result of collisions in thermal motion. It is 
transferred to them by other molecules. The dependence of the 
relaxation time (13.2.4) on E and the absolute temperature is 
of the form 


E/RkT 
? 


(T) = Tye 


* Substances with an exceptionally low critical temperature are 
not considered here. 
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where k is Boltzmann’s constant (8.4.5) and tọ is the average 
period of vibrations of the molecule about its equilibrium posi- 
tion. 

13.2.6 When a liquid is subject to the action of an external 
force for the length of time ¢ >> (t) the particles of the liquid 
are displaced mainly in the direction of this force. This reveals 
the fluidity of the liquid. But if t < (t) a particle has no oppor- 
tunity to change its equilibrium position during the time of 
force action. At this the liquid manifests elastic properties, 
resisting changes in both volume and shape. 

During the time (t) a particle of liquid is displaced, on an av- 
erage, over the distance (d), the average velocity (v) of dis- 
placement of the molecules is determined by the equation 


The average velocity of motion of molecules in liquids is quite 
considerable, as a rule, but an order of magnitude less than the 
average velocities of molecules of the vapour of the same sub- 
stance at the same temperatures. 


13.3 Diffusion and Viscosity Phenomena in Liquids 


13.3.4 If conditions required for transport phenomena (10.8.1) 
arise, diffusion, heat conduction and viscosity are observed in 
liquids. The difference between transport phenomena in liquids 
and similar phenomena in gases affects the transport coefficients 
ae Y dependence on the characteristics of properties of the 
iquids. 

13.3.2 For a chemically homogeneous liquid the diffusion coef- 
ficient D (10.8.3) is calculated by the equation 


D = 4 4 -ERT 
6 To 


The quantities are defined in Sects. 13.2.3, 13.2.4 and 13.2.5. 
The diffusion coefficient grows rapidly with temperature. This 
is due, for the most part, to the sharp drop in the relaxation 
time (t) (13.2.4). Besides, the quantity (d) (13.2.3) increases 
somewhat with the temperature T. 
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13.3.3 When the temperature approaches the critical value 
(12.3.4), the average velocity (v) of the particles of the liquid 
approaches the average velocity of molecules of a real gas and 
the values of the diffusion coefficient D for liquids become close 
to those for gases. 

At temperatures much lower than the critical one, the diffusion 
coefficients for liquids are extremely small compared to those 
lor the corresponding vapours or gases at ordinary pressures. 
lor example, D ~ 1.5 X 10-9 m?/s for water at T = 300 K, 
while for the vapour of water (steam) in air at the same temper- 
alure and at standard atmospheric pressure D ~ 2 x 10- m?/s. 
13.3.4 At temperatures close to the critical one*, the thermal 
motion in liquids acquires features that differ from those de- 
scribed in Sect. 8.3.5 and approach those of thermal motion in 
gases. Under these conditions the internal friction (viscosity) in 
liquids is of the same nature as in gases (10.8.4). 

At temperatures close to the melting point, the viscosity of 
liquids cannot be accounted for in the same way as for gases. 
The mechanism by means of which viscosity arises is a complex 
one. The coefficient of dynamic viscosity n (10.8.4) for liquids 
may be related to the mobility ug of the molecule. This is defined 
as the velocity u gained by a molecule subject to an external 
force F equal to unity: u, = u/F. The relation between n and 
Ug is an inversely proportional one: n & up’. In its turn uy x 
x D/kT, where D is the diffusion coefficient, T is the absolute 
(emperature, and k is Boltzmann’s constant (8.4.5). Consequent- 
ly, no T/D or n œ T exp (E/kT), where £ is the activation 
energy (13.2.5). As the temperature is raised the viscosity of 
liquids drops rapidly, especially in the region of low tempera- 
(ures. At high pressures the viscosity of liquids increases rapidly 
with pressure. This is due to the increase in the activation en- 
ergy (13.2.5) and the corresponding increase in the relaxation 
lime (13.2.4). 


13.4 Surface Tension of Liquids 


13.4.1 A molecule of a liquid in the surface layer is subject to 
uncompensated, inwardly directed forces of attraction exerted 
by the rest of the liquid. As a result, the surface layer exerts a 
* See the footnote on page 209. 
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high internal pressure on the whole liquid. It is of the order of 
tens of thousands of atmospheres. 

13.4.2 Particles of the surface layer of a liquid have a higher 
potential energy than those inside the liquid. The reason for 
this is that in order to emerge from inside the liquid to its 
surface the molecules must do work to overcome the inwardly 
directed forces of internal pressure (13.4.1). This work increases 
the potential energy of the molecules that reach the surface. 
The work W that must be performed to increase the area of the 
surface layer of a liquid isothermally is equal to 


W = ((Fs—Fy)) N, 


where ((Fs — Fy)) is the average difference between the free 
energy (11.4.5) per molecule at the surface Fg and within the 
bulk Fy of the liquid, and N is the number of molecules in the 
surface layer of the liquid. 
13.4.3 The condition of minimum potential energy of a liquid, 
required to maintain its stable equilibrium, is complied with 
when the liquid has its minimum possible free surface area. The 
state of stable equilibrium of a liquid incompressible body cor- 
responds to the minimum ratio of its surface area to its volume. 
Therefore, small drops of liquid suspended in air are of spheri- 
cal shape. A liquid tends to reduce the area of its free surface. 
Consequently, the surface layer resembles a stretched elastic 
film in which tensile forces act (see also Sect. 13.4.5). 
13.4.4 The work done in the isothermal formation of unit sur- 
face area is called the surface tension o of the given liquid at 
the boundary with another phase (12.3.3): 
W N 

o= smk) =(Fs— Fy) my 
where n; = N/A is the number of molecules per unit area of 
the surface layer. The surface tension is also calculated by the 
equation 
gest AF 

~ AA 
where AF is the change in the free energy of the surface layer, 
and AA is the change in the surface area. 
The surface tension o depends upon the chemical composition 
of the liquid and its temperature. It decreases when the tem- 
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perature is raised and vanishes at the critical temperature 
(12.3.1). The surface tension can be reduced by adding surface- 
aclive substances to the liquid. This is due to the fact that such 
substances are adsorbed on the surface layer of the liquid and 
reduce the free energy of this layer (14.4.5). 

13.4.5 There is an essential difference between the surface 
layer of a liquid and an elastic film (13.4.3). The surface ten- 
sion of liquids does not depend upon the surface area and tends 
to reduce this area to zero. The tension of an elastic film is pro- 
portional to its deformation and equals zero at a definite finite 
area of the film’s surface. 

This peculiarity in the properiies of liquid films is associated 
with the change in the number of molecules in the surface layer 
when these films are stretched (or compressed). But the average 
distances between the molecules and the forces of intermolecular 
interaction determined by these distances do not change. This 
is why the magnitude of the surface tension is independent of 
the area of the free surface of the liquid. 

13.4.6 If the surface of a liquid is bounded by a wetted per- 
imeter (13.5.1), then the quantity o is equal to the force acting 
on unit length of the wetted perimeter, perpendicular to this 
perimeter. This force lies in a plane tangent to the free surface 
of the liquid. i 


13.5 Wetting and Capillary Phenomena 


13.5.1 The free surface of a liquid, curved adjacent to the wall 
of the vessel holding the liquid, is called the meniscus. The line 
along which the meniscus contacts the wall is called the wetted 
erimeter. The meniscus is characterized by the contact angle 0 
between the wetted surface of the wall and the meniscus at their 
point of contact. When 0 < n/2 (Fig. 13.1a), the liquid is said 
to wet the wall (to be a wetting liquid); when ® > x/2 (Fig. 13.10) 
the liquid does not wet the wall and is a nonwetting liquid. 
Wetting (nonwetting) is said to be ideal when 0 = 0 (0 = n). 
Here the meniscus is of spherical shape and is concave or con- 
vex. The absence of both wetting and nonwetting corresponds to 
the condition 0 = m/2, in which case the liquid has a plane 
free surface. 

{3.5.2 The formation of a meniscus is due to the interaction of 
the molecules with one another and with the particles of the 
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rigid body (wall). Molecule A of the surface layer, located close 
to the wall of the vessel and having a sphere of molecular action 
(12.2.3) of radius Rn (range of molecular action) (Fig. 13.2), is 
subject to the resultant force of attraction exerted on it by all 
the other molecules of the liquid (F,) and by all the particles 
of the wall (F,). Force F, is perpendicular to the wall. This fol- 
lows from considerations of symmetry. The direction of force 
F, depends upon the shape of the meniscus and the position of 


(a) (4) 
Fig. 13.1 


molecule A with respect to the wall. For example, if the menis- 
cus is flat (13.5.1) and molecule A is right at the wall, force F, 
is directed at an angle of 45° to the wall (Fig. 13.3). Molecule A 
can be in equilibrium only when the resultant force F = F, + 
+ F, is perpendicular to the surface of the liquid.* Otherwise, 
molecule A would move along this surface. 

13.5.3 The shape of the meniscus is determined by the three 
possible directions of force F: 

(a) force F is parallel to the surface of the wall, the liquid has 
a plane surface and 0 = n/2 (Fig. 13.4a); 

(b) force F is directed towards the wall; the forces of attraction 
exerted by the wall on molecule A exceed the forces of attrac- 
tion exerted on the molecule by all the other molecules of the 


* Here we ignore the gravity force acting on molecule A because 
it is negligibly small compared to forces F} and Fy. 
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liquid. The liquid has a concave meniscus with 6 < 1/2, i.e. 
the liquid wets the wall (Fig. 13.4.0); 

(c) force F is directed towards the liquid; the forces of attrac- 
lion exerted by the molecules of the liquid on molecule A 


Fig. 13.2 Fig. 13.3 


prevail over those exerted on it by the particles of the wall. The 
meniscus is convex with 0 > 7/2, i.e. the liquid does not wet 
the wall (Fig. 13.4c). 

13.5.4 The curved shape of the surface layer is the cause of the 
pressure Ap, additional to the external pressure, acting on the 


Fig. 13.4 


liquid. This additional pressure is due to the surface tension. 
This is similar to the way in which a stretched elastic envelope 
(like a rubber balloon) exerts pressure on the gas it contains, 
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The additional pressure exerted on a liquid with a surface layer 
of arbitrary shape is calculated by the equation 


1 1 
= ae E 
Ap: o R, | R, E 


where o is the surface tension (13.4.4), and R, and R, are the 
radii of curvature of any two mutually perpendicular normal 
sections* of the surface of the liquid at the point where mol- 
ecule A is located. The radius of curvature R, (or R>) is taken 
positive if its centre in the corresponding section is inside the 
liquid. Otherwise, the radius of curvature is taken to be nega- 
tive. Thus, Ap > 0 when the meniscus is convex and Ap < 0 
when it is concave. In the case of a plane surface R; = R, = 
= œ, and there is no additional pressure (Ap = 0). The pre- 
ceding equation is called the Laplace theorem. For a spherical 
surface R = Ra = R and Ap = 20/R. Such a surplus pressure 
exists, for instance, within a bubble of gas of radius R, located 
inside a liquid near the surface. 

The surplus pressure inside a soap bubble of radius R is due to 
the action of both superficial layers of the thin spherical film 
of soap (inside and outside the bubble). Thus Ap = 4o/R. 
13.5.5 The level of a liquid in narrow cylindrical vessels (capil- 
lary tubes) of radius r differs from the level of the liquid in a 
larger vessel into which the capillary tube is placed vertically. 
The level in the capillary tube is higher (lower) than in the lar- 
ger vessel by the amount h if the liquid wets (does not wet) the 
walls of the vessel. Here 


= 20 cos 8 
rgp 


where @ is the contact angle (13.5.1), p is the density of the liq- 
uid, and g is the acceleration due to gravity (7.3.3). This is 
called Jurin’s rule. 

If the liquid is in a narrow slit of constant width ô between two 
parallel plates, the meniscus is of cylindrical shape having 


* A normal section of the surface through point A is the curve 
obtained as the intersection of the surface and a plane passed 
through the normal to the surface at this point. Radii R, and R, 
are called the principal radii of curvature of the surface layer. 
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the radius 6/2 and the level in the slit is above (for a wetting 
liquid) or below (for a nonwetting liquid) the level in the vessel 
in which the plates are immersed by the amount 


20 cos 8 
5p 
13.5.6 The pressure of saturated vapour (12.3.2) above the 
curved surface of a liquid depends upon the shape of the menis- 
cus. If the meniscus is concave (convex), the pressure of the 


saturated vapour is less (greater) than that above a plane sur- 
face by the amount Apps: 


lh = 


p 4 
APms = ———— AP, 
ms 01—P 
where p is the density of the saturated vapour, p; is the density 
of the liquid, and Ap is the additional pressure due to the cur- 
vature of the surface (13.5.4). 


13.6 Vaporization and Boiling of Liquids 


13.6.1 Vaporization, or evaporation, is the process of vapour 
formation at the free surface of a liquid. It takes place at any 
temperature and increases as the temperature is raised. There 
are molecules in, the surface layer of a liquid that have high 
velocities and kinetic energies of thermal motion. The escape 
of such molecules from the surface into the space above the liq- 
uid is the explanation for evaporation with the consequent 
reduction of the store of internal energy and cooling of the liq- 
uid. A measure of the process of vaporization is the rate of 
evaporation u, which is the amount of liquid converted into 
vapour in unit time. The rate u depends upon the external pres- 
sure and the motion of the gaseous phase above the free surface 
of the liquid. Thus 


v= (Py — P) 

= Po Vv ’ 
where c is a constant, A is the area of the free surface of the liq- 
uid, py is the pressure of saturated vapour, p is the pressure 
of the vapour above the free surface of the liquid being vapor- 
ized, and p, is the external barometric pressure. 
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13.6.2 Boiling is the process of intense vaporization of a liquid, 
not only from its free surface, but throughout the whole volume 
of the liquid within the bubbles of vapour that are produced 
in boiling. The bubbles of vapour in a boiling liquid rapidly 
increase in size, rise to the surface and burst. This causes the 
turbulence in a boiling liquid. The pressure p within a bubble 
of vapour located within the liquid is the sum of the external 
pressure pọ, the hydrostatic pressure pı of the overlying layers 
of liquid, and the additional pressure Ap due to the surface 
tension (13.5.4). Thus 


P= Po + pı + Ap, 
and 


20 
Pi=egh and a 


where r is the radius of the vapour bubble, k is the distance 
from its centre to the surface of the liquid, and o and o are 
the density and surface tension of the liquid. 

The boiling of a liquid begins at the temperature at which the 
pressure py of the saturated vapour inside a bubble is at least 
as high as the pressure p (13.6.2): 


20 
v = Pot Pgh + as 


If this condition is not complied with, the bubble collapses and 
the vapour it contains is condensed. 

13.6.3 With small radii r of the vapour bubbles, the pressure py 
should be quite high and it is necessary to heat the liquid toa 
high temperature before it starts boiling. If the liquid contains 
centres of vaporization (dust specks, bubbles of dissolved gases, 
etc.), boiling begins at a much lower temperature. This is due 
to the fact that the vapour bubbles that appear on the centres 
of vaporization are of such a size that the effect of the third 
member in the inequality of Sect. 13.6.2 can be neglected. More- 
over, pgh < py as a rule, so that the approximate condition for 
the beginning of boiling can be written in the form z 


Py © Po 
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The temperature of a liquid at which the pressure of its saturat- 
ed vapour is equal to the external pressure is called the boil- 
ing point of the liquid. 
13.6.4 When a liquid boils at constant pressure, its temper- 
alure remains constant. The quantity of heat transferred to a 
boiling liquid is completely expended in converting the mole- 
cules of the liquid into vapour. The heat rp required to vapor- 
ize unit mass of a liquid heated to the boiling point is called 
the heat of vaporization. The value of rp decreases as the boil- 
ing point is raised and vanishes at the critical temperature 
(12.3.4). 
The change in the internal energy of a liquid (9.1.1) as a unit 
of its mass is converted into vapour at the boiling point is 
called the internal heat of vaporization. | 
13.6.5 The boiling of liquids and the condensation of vapour are 
examples of first-order phase transitions (cf. Sect. 12.4.2). Typ- 
ical of such phase transitions are the simultaneous constant 
values of pressure and temperature, but the changing ratio of 
the masses of the two phases (12.3.3). For a first-order phase 
transition to occur, the heat rp (specifie latent heat of vaporiza- 
tion) must be supplied to or taken away from the system. The 
heat rp per unit mass is calculated by the Clausius-Clapeyron 
equation: 

r 2 
Th = (ve Vy) aT ? 
where v, and v, are the specific volumes of the substance in the 
initial and final phases, T and p are the temperature and pres- 
sure of the phase transition. 
13.6.6 It follows from the Clausius-Clapeyron equation for the 
boiling of liquids that 


da? (vy —v}) T 
dp rh i 


where vı and vy are the specific volumes of the liquid and vapour 
at the boiling point T. Since vy > vı and rp > 0, dT/dp > 0, 
i.e. the boiling point increases with pressure. For instance, at 
the pressure p = 1.25 X 107 Pa water can be heated without 
boiling to a temperature at which lead can be melted in the 
water. The melting point of lead is 900 K, 


PART THREE 
ELECTRODYNAMICS 


CHAPTER 14 ELECTRIC CHARGES. 
COULOMB'S LAW 


14.1 Introduction 


14.1.1 Electrostatics is the branch of the study of electricity 
that deals with the interaction and properties of a system of 
electric charges that are fixed with respect to the chosen inertial 
frame of reference (2.1.2). 

There are two kinds of electric charges, positive and negative. 
Bodies having unlike charges attract one another; those having 
like charges repulse one another. 

14.1.2 The electric charge of any system of bodies consists of a 
whole number of elementary charges, each equal to 1.6 X 
x 10-19 C (where C is the abbreviation of coulomb, the unit of 
charge). The stable particle with the minimum rest mass (5.6.1) 
and a negative elementary charge is the electron. The rest mass 
of the electron is 9.1 X 10-%! kg. The stable particle with the 
minimum rest mass and a positive elementary charge is the 
positron (43.5.1).* Its rest mass is the same as that of the elec- 
tron. Besides, there is a stable particle with a positive elemen- 
tary charge called the proton. The rest mass of the proton is 
1.67 X 10-2? kg. Electrons and protons are found in the atoms 
of all the chemical elements. 

14.1.3 The law of conservation of electric charge states: the 
algebraic sum of the electric charges of bodies or particles that 
constitute an electrically isolated system remains constant re- 
gardless of any processes that occur in the system. 

New electrically charged particles may be formed in the system 
being considered. These may be electrons formed by the ion- 
ization of atoms or molecules (22.4.1), ions formed by the phe- 


* Here we do not consider the instability of the positron, com- 
mon in our world, that is associated with the annihilation of 
an electron-positron pair (43.5.1). 
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nomenon of clectrolytic dissociation, etc. If the system is elec- 
trically isolated, however, then the algebraic sum of the charges 
of all the newly formed particles of the system always equals 
zero. The law of the conservation of electric charge is one of 
the fundamental conservation laws, like the laws of conservation 
of momentum (2.7.1) and of energy (3.4.2). 

14.1.4 Upon contact of two electrically neutral bodies, charges 
go over from one body to the other as a result of friction. In 
each of such two bodies there is no longer an equal number of 
positive and negative charges and the bodies become oppo- 
sitely charged. 

When a body is electrized by induction, the charges in it are no 
longer uniformly distributed. The charges are redistributed so 
that one part of the body has an excess of positive charges and 
the other part, of negative charges. If these two parts of the 
body are separated in some way, they will be oppositely charged. 


14.2 Coulomb’s Law 


14.2.1 Coulomb experimentally established that the force of 
interaction Fi between two small charged spheres, with charges 
equal to q, and q, (14.1.1), is proportional to the product q1q2 
and inversely proportional to the square of the distance r be- 
tween the spheres: 


where k, is the proportionality factor (k; > 0). 
14.2.2 Central forces act on the charges along a straight line 
joining the centres of the charges. In the interaction of like 
charges the product 9,9, > 0 and Fia > 0. This corresponds to 
mutual repulsion. For unlike charges qiqa < 0 and Fy, < 0, 
which corresponds to mutual attraction. The vector form of the 
Coulomb law is 

r 
Fio=hy Ai Ea ’ 


where F, is the force exerted on charge q, by charge q3, ry is 
the radius vector joining charge gq, to qı, and r = |r; 
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(Fig. 14.42). The force Fa, exerted on charge q by charge q, 
equals 


9192 Vor 
Foi es kı r2 r ? 
where r, = —r;s is the radius vector joining charges q, and q3 


(Fig. 14.1b). Forces F; and F.; are said to be Coulomb forces. 
14.2.3 Coulomb’s law is valid for the interaction of point elec- 
tric charges that are fixed in the given inertial frame of refer- 
ence. Point charges are charged 
bodies whose linear dimensions 
(a) Fae, "2 o dare very much smaller than, 
s0 >Q the distance r between them 
A J3 
(da & r). 
Coulomb’s law states: the force 
of electrostatic interaction be- 
ro Fo, tween two point electric charges 
(6) o——____—_——_ >_> is proportional to the product of 
g,>0 g2>0 the magnitudes ofthe charges, 
inversely proportional to the 
Fio"14.4 square of the distance between 
a them and acts along the straight 
line joining them. 
In the form given in Sect. 14.2.2 Coulomb’s law is also valid 
for the interaction of charged spheres of radii R, and R, if the 
charges q, and q of the spheres are uniformly distributed over 
their surfaces or throughout their volumes. Radii R, and R, 
may be commensurable with the distance r between the centres 
of the spheres. 
14.2.4 A Coulomb force (14.2.2) depends upon the properties of 
the medium in which the interacting charges are located. This 
dependence can be taken into consideration if we consider the 
factor kı in the formulas of Sects. 14.2.1 and 14.2.2 as the ratio 
of two factors: 


k 


: er 

where k is a factor depending only on the choice of the system 
of units of measurement and grp is the dimensionless relative 
permittivity, or dielectric constant, of the medium and character- 
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izes its electric properties. It is assumed here that the medium 
is boundless, homogeneous and isotropic, i.e. that its properties 
are the same throughout its volume and are independent of the 
direction. For a vacuum, €r is considered to equal unity. 
'14.2.5 For a medium with the relative dielectric constant ¢,, 
Coulomb’s law takes the form 


F=k dil, . 
Err? 


Vor charges q, and q located in vacuum (2, = 1), 


» — 1 9192 
maar 


From this follows the meaning of the relative dielectric con- 
stant 


t. = F,/F. 


14.2.6 The concept of €,, introduced in Sects. 14.2.4 and 14.2.5, 
has a meaning only under the conditions stipulated in Scct. 
14.2.4. An e,-fold reduction of F is due to a phenomenon 
known as electrostriction, i.e. the deformation of a dielectric 
(18.1.1) from the effect of charged bodies. In their deformation, 
liquid and gaseous dielectrics are directly adjacent to the charged 
bodies and exert an additional mechanical action on them, 
reducing the Coulomb force (14.2.2). In solid dielectrics, charges 
qı and qa on the bodies may be located within certain cavities. 
This makes the calculation of forces F; and F,, (14.2.2) con- 
siderably more difficult. These forces depend on the shapes of 
the cavities in which the charges q, and q are located. In this 
case the interpretation of g, given in Sect. 14.2.5 cannot be 
correct. 

14.2.7 In SI units (Appendix I) the factor k in the formulas of 
Sect. 14.2.5 are taken equal to 


— 1 
«ANE, 


and Coulomhb’s law is written in the form 


7 1 4192 
ANE, Err? : 
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This form of Coulomb’s law and of all laws following from it is 
said to be rationalized. 

The quantity £ is called the permittivity of free space or, accord- 
ing to the SI units, the electric constant. It is equal to 8.85 x 
x 10-12 C2/N-m? (Appendix II). 

The product €)€, is sometimes called the absolute dielectric con- 
stant, or absolute permittivity of the medium. 

In the cgse system of units (Appendix I), Coulomb’s law is 
written in the following irrational form: 


_ 4192 
Err? i 


CHAPTER 15 ELECTRIC FIELD STRENGTH 
AND DISPLACEMENT 


15.1 Electric Field. Field Strength 


15.1.1 A force field is a form of matter that is studied in physics 
along with its study of other forms of matter known common- 
ly as substances. The most basic feature of force fields is that 
various kinds of interaction are accomplished by means of 
them. A gravitational field (6.2.1), for instance, accomplishes 
gravitational interaction between bodies of various masses 
located in the field. All force fields have vital properties that 
characterize their material nature. Primarily this concerns their 
energy. There are no impassible barriers between a field and 
material substances; one can be converted into the other 
43.5.5). 

Wai field cannot be defined as space in which certain 
forces act. Space, like time, is a form of the existence of matter. 
The fact that a field exists in space by no means infers that the 
field can be identifed with space, because the form of existence 
of matter cannot be confused with the matter itself. 

15.1.2 Coulomb interaction (14.2.2) between fixed electrically 
charged particles or bodies is accomplished through the elec- 
trostatic field set up by the charges. An electrostatic feld is a 
steady-state electric field, i.e. one that does not change in the 
course of time, and is set up by fixed electric charges. This is 
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mie of the forms of an electromagnetic field by means of which 
interaction is accomplished between electrically charged par- 
ticles (or bodies) that travel, in the general case, in some arbi- 
trary way with respect to the chosen frame of reference. 

A typical property of an arbitrary electric field, distinguishing 
it from all other physical fields, is the action it exerts on both 
travelling and fixed electric charges (charged particles and 
bodies). 

15.1 S The quantitative force characteristic of the action exert- 
ed by an electric field on charged particles and bodies is the 
vector E of the electric field strength (formerly called the elec- 
tric intensity). The electric field strength at a given point is 
(he ratio of the force F, exerted by the field on a fixed point 
(14.2.3) electric test charge placed at the point of the field being 
considered, to the magnitude qa of the charge: 


F 


3 
D EN 


lo ` 


It is assumed that the test charge qo is so small that its pres- 
ence does not lead to a redistribution in space of the charges 
that set up the field being investigated. In other words, the 
lest charge does not distort the field. 

An electric field is said to be uniform (a uniform electric field) if 
the electric field. vector E has a constant magnitude and direc- 
Lion at any point of the field. 

15.1.4 The strength of the electrostatic field of point charge q 
at a point located at the distance r from the charge is 


VS 1 q be . 

a Err? r (in SI units) 

EREE (in cgse units) 
err? 


where r is the radius vector joining charge q and the point at 
which the field strength is to be calculated. Vectors E are direct- 
ed at all points of the field radially away from charge q when 
qœ 0, and directed radially toward the charge when q < 0. 


15--0186 
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The projection Æ, of the field strength on the direction of radius 
vector r is 


E E 


r I —— 
ÁNEg Err? 


E,= (in cgse units). 


Err? 
These formulas are also used to calculate the strength of the 
feld set up by a hollow sphere, having the charge q on its sur- 
face, at the distance r from the centre of the sphere whose radius 
equals R. Inside the hollow sphere Er = 0. 

15.1.5 The force F exerted by an electric field on an arbitrary 
charge q located at a given point in the field is 

F = gE, 

where E is the field strength at the point where charge q is lo- 
cated and is distorting the field, i.e. making it different from 
the field set up before introducing charge q. 

15.1.6 Electrostatic fields are represented graphically by employ- 
ing lines of force. 

Lines of force are imaginary curves drawn in such a way that 
their direction at any point (i.e. the direction of the tangent to 
the line at the point) coincides with that of the electric field 
vector. Lines of force are assumed to have the same direction 
as the corresponding electric field vectors. Lines of force do not 
intersect because at each point of the field vector E has only a 
single direction. 

Lines of force are not identical with the paths of motion of 
light charged particles in an electrostatic field. Tangent at each 
point of the path of a particle is its velocity. Tangent to a line 
of force is the force acting on a charged particle. This, conse- 
quently, corresponds to the acceleration of the particle. 


15.2 Principle of Superposition of Electric Fields 


15.2.1 The basic problem of electrostatics can be formulated as 
follows: find the magnitude and direction of the electric field 
vector E for each point of a field from the given space distribu- 
tion and magnitudes of the electric charges that are the source 
of the field. 
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15.2.2 When the field is set up by a system of fixed charges 
is Ja, 13, +++) Qn, the resultant force F exerted on a test charge 
dq (15.1.3) at any point of the field being considered is equal to 
the vector sum of the forces F; exerted on charge qọ by the 
fields set up by each of the charges q;: 

i=n 
PS S 

i=1 


According to Sect. 15.1.3, F = qE and F; = q,E;, where E 
is the strength of the resultant field, and E; is the strength of 
the field set up by charge q;. Following from the preceding for- 
mulas is the principle of independent action of electric fields, or 
the principle of superposition of electric fields: 


n 
E= 5) Ej. 
i=l 


The strength of an electric field of a system of charges is equal 
to the vector sum of the strengths of the fields set up by each 
charge separately. The strength of the resultant field is found 
by superposition of the strengths of the fields of the separate 
charges. 

For charges distributed continuously in space (15.2.3), the 
principle of superposition of fields is of the form 


where integration extends over all the continuously distributed 
sources of the fields, setting up electric fields with the strength 
dE. 


Example. A system of fixed point charges qi, qo, .--, dn 
sets up an electrostatic field whose strength E equals 
1 n 
ous i eee ; 
L inte D FE r; (in SI units) 
1= 
n 
= » T. r; (in cgse units) 
i=1 
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where r; is the radius vector from point charge q; to the point 
being investigated in the field. Any charged body on which 
the charge Q is discretely distributed (15.2.3) can be divided into 
extremely small parts, each having a point charge. Therefore, 
the preceding formula is of general significance forcalculating 
electrostatic fields in a homogeneous, isotropicmedium that 
fills the whole field. 

15.2.3 Electric charges, the sources of electrostatic fields, can 
be distributed in space either discretely (discrete charge distri- 
bution) at various points in space, or continuously (continuous 
charge distribution). In the latter case, the charges are distrib- 
uted either along a certain line, on the surface of some body 
or within a certain volume. The concept of charge density is 
introduced for continuous charge distribution. When the elec- 
tric charge is distributed continuously along a line we have 
linear charge density T: 


_ 24 
=a 
where dq is the charge of an infinitesimal clement of length dl. 


When the electric charge is distributed continuously over 
a certain surface, we speak of the surface charge density o: 


T 


d 
(0) ee g 
dS 
where dq is the charge on an infinitesimal element of arca dS. 
When the electric charge is distributed continuously throughout 
some volume, we speak of the volume charge density p: 


P= ay ? 
where dq is the charge in an infinitesimal element of volume 


Example. The electrostatic field of an electric dipole. An elec- 
tric dipole is a system of two charges of equal magnitude, but 
of opposite sign. These charges, g and —g (q > 0) are separated 
by the distance J which is small in comparison with the distance 
to the points being considered in the field. The arm of the dipole 
ts the vector l, directed along the dipole axis from the negative 
ioward the positive charge and equal to the distance between 
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them (Fig. 15.1). The product of the charge q of the dipole 
(q > 0) by the arm l is called the electric dipole moment pe: 


Pe = ql. 


The strength E of the field of an electric dipole at an arbitrary 
point is: 


E=E,+E., 


where E, and E_ are the strengths of the fields set up by the 
charges q and —q (Fig. 15.14). 


z t I Pe EA E E. 


Fig. 15.1 


At point A, located on the dipole axis at the distance r from 
its middle (Fig. 15.1) (r >> l), the strength of the field of a 
dipole equals: 


__ 1 pe 
«ANG, Err? 


E 


(in SI units), 


4 


2 . ; 
T (in cgse units). 
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At point B, located on a perpendicular erected at the middle 
of the dipole axis and at the distance r from the middle of the 
axis (r > l) 


N 1 Pe |. 
E=— ine, er (in SI units), 
E———e (in cgse units). 


Err’ 


At the arbitrary point O, sufficiently distant from the dipole 


Fig. 15.2 


(r >> l) (Fig. 15.2), the magnitude of its field strength is: 


e. 1 Pe 
Aney Err? 


V 3 cos? 0+1 (in SI units), 


E= ao V 3cos?0+1 (in cgse units). 
Err 


15.3 Electric Displacement. Ostrogradsky-Gauss 
Electric Flux Theorem 


15.3.1 The electric field strength (15.1.3) depends upon the prop- 
erties of the medium. In a homogeneous isotropic medium, the 
strength E is inversely proportional to ¢, (14.2.5). Another 
characteristic of an electric field, along with its strength E, 
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is the vector D of electric displacement, or electric induction 
(other names are dielectric displacement, dielectric flux density, 
displacement, electric displacement density and electric flux 
density). For a field in an electrically isotropic medium, the 
relation between D and E is of the form 


D = €,€rE (in SI units), 
D = e E (in cgse units). 
The general relation for D, valid for anisotropic media is given 


in Sect. 18.3.4. 
\xample. For the field of point electric charge q (14.2.3) 


1 q 
De at (in SI units), 
D=-4 r (in cgse units). 


The projection of D on the direction of radius vector r, from 
the point charge to the given point in the field, is 


1 


Ore 


> (in SI units), 
D, =-4- (in cgse units). 


15.3.2 An element d®, of electric displacement flux through an 
element of surface with the area dS is the scalar physical quan- 
tity defined by the equation 


d, = D dS = D dS cosa=D,dS =Dds,, 


where D is the displacement vector (15.3.1), n is a unit vector 
normal to area dS, dS = dS n is the vector of the element of 
surface dS, D, = D cos @ is the projection of vector D on the 
direction of vector n, dS | = dS cos æ is the area of the pro- 
jection of the element of surface dS on a plane perpendicular 
to vector D, and @ is the angle between vectors D and n 
(Fig. 15.3) 
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When the electric field is set up by point charge q, the element 
of displacement flux dD, through the element of sur‘ace dS 
of the closed surface S, within which the point charge is located, 
is 

ne A 
dDe= E dw, 


where dw is the solid angle subtended at point charge q by the 
element dS of surface S (Fig. 15.3). 


The total displacement flux ®, through the surface S is the sum- 
mation or integration of all the elements of flux: 


De = f D dS = | D dS cos a= | D, dS = | Das, 
S S S S 


where a is the angle between vectors D and n. Iere all the vec- 
tors and normals to the elements dS of surface are directed 
toward the same side of surface S. In the case of a closed sur- 
face S (Fig. 15.3), all the vectors n of the normals must be 
either outward or inward ones*. 


* Only outward normals are used in the following. 
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1.3.3 The Ostrogradsky-Gauss theorem states that the displace- 
ment flux passing through an arbitrary closed surface is equal 
lo or proportional to the algebraic sum of the electric charges 
‘iy l2, q3, ++ +, qp enclosed within the surface: 


k 
$ D dS= 9 Pr dS = > q; (in SI units), 


S i= į 
k 


$ D dS = 9 Dy dS =4n 5 qi (in cgse units). 
S i=1 


The displacement flux passing through an arbitrary closed sur- 
face that does not enclose any electric charges equals zero. 
The differential form of the Ostrogradsky-Gauss theorem is 
one of Maxwell’s equations for an electromagnetic field (27.4.2.) 
v15.3.4 The Ostrogradsky-Gauss theorem for the electrostatic 
field in vacuum states that vector flux of electrostatic field 
strength in vacuum passing through an arbitrary closed surface 
is proportional to the algebraic sum of the electric charges 
enclosed ee the surface: 


pr dS= i5 qi [in SI units, where gẹ is the electric 
i=1 constant (14.2.7)], 
k 

tg dS = 4n 3 qi (in cgse units). 
i=1 


The Ostrogradsky-Gauss theorem for the field in a dielectric is 
liscussed in Sect. 18.3.3. 

15.3.5 Together with the principle of superposition of fields 
(15.2.2), the Ostrogradsky-Gauss theorem is applied to calculate 
electric field vectors D. This is done by choosing a closed sur- 
face in such a way that in the expression for the displacement 
flux, the quantity D can be placed outside the sign of the sur- 
face integral. This can be done for fields set up by the sim- 
plest symmetrically arranged charges (charged lines, plane, 
sphere, etc.). 
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CHAPTER 16 ELECTRIC FIELD POTENTIAL 


16.1 Work Done in Moving an Electric Charge in an 
Electrostatic Field 


16.1.1 The work done in moving an electric charge q’ in an 
electrostatic field of strength E (15.1.3) is independent of the 
shape of the path along which the charge is moved; it depends 
only upon the initial and final positions of the charge. In other 
words, electrostatic forces, like gravitational forces, are poten- 
tial (conservative) forces (3.1.6). The work ôW done by the 
force F = q’E exerted on charge q’ in moving it over the dis- 
tance dl, is equal to: 


ôW = F dl cos ọ = g'E cosa dl, 


where œ is the angle between the directions of vectors E and 
dl, and @ is the angle between the directions of the vectors F 
and dl. The work done in a finite movement of charge q’ from 
point a to point b is equal to 


b b 
W=q \ E cosadl=q’ \ E dl, 
a a 


where E dl is the scalar product of vectors E and dl. 
46.1.2 If the electric field has been set up by charge g, then 


eee soe 2 
= Free.’ dl cosa=—dr, 

and the work done in the movement of charge q’ from point a 
to point b in this field is equal to 


T2 
Waen \ T= (E) 
ANEgEr r? — 4NEočA T TaJ?’ 
ry 
where r; and r, are the distances of points a and b from charge q 
(Fig. 16.4). 
The work W done by the electric forces of repulsion of like 
charges q and q’ is positive when the-charges move away from 
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cach other, and negative when they move toward each other. 
The work done by electric forces of attraction of unlike charges 
q and q’ is positive when they move toward each other and 
negative when they move away from each other. 

A6.1.3 When an electric charge q’ is moved in a field set up by 
a system of point charges q4, q2, .--, Gn, the charge q’ is sub- 
ject to the force 


F=F,+F,+...+Fn 


and the work W done by the resultant force is equal to the 
algebraic sum of the work done by the component forces: 


n 
'qi 1 1 
W=W: +... Wn = X z — E 
i=1 


WE gE r Tia Tio 


where r; and r;. are the distances of charge q; to the initial 
and final positions of charge q’. Each of the amounts of work W; 
and the total work W depend / 

upon the initial and final posi- 
tions of the charge q’, but are 
independent of the shape of 
its path. 

16.1.4 Circulation of the field 
strength vector along the closed 
path or loop ZL is the line in- 
tegral 


) E dt cos a= Edl. 
L L 


This integral is numerically equal to the work done by elec- 
trostatic forces in moving a unit positive charge along a closed 
path. Since r;; = rj. for a closed path, we have, according to 


Sect. 16.1.3, È E dl = 0, i.e. the above-mentioned work 
L 


Fig. 16.4 


equals zero. 

A force field of strength E that complies with such a condition 
is said to be a conservative one. An electrostatic field is con- 
servative, 
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The differential form of the condition indicating the conservative 
nature of an electrostatic field is one of Maxwell’s equations 
for an electrostatic field (27.2.1). 


16.2 Potential of an Electrostatic Field 


16.2.1 The work ôW (16.1.1) is equal to the decrease dE, in 
potential energy (3.3.1) of the charge gq’ moving in an electro- 
static field: 


ôW = —dEy 
and 


b 
W = \ q' E di cos a= —AEp= Eip— Eep, 
a 


where £,, and Ep are the values of the potential energy of 
charge q’ at points a and b, and œ is the angle between the 
„directions of vectors E and dl (Fig. 16.1). 

16.2.2 When point charge q’ is moved by electrostatic forces 
in a field set up by charge gq, the change dE, in its potential 
energy in an infinitesimal displacement is 


EEE ee C EL 
Ao oe Ane tyr? ° 

Upon a finite displacement of charge q’ from point a to point b 
(Fig. 16.1), the change AZ, in the potential energy of the charge 
is 

r2 


j qq’ qq’ 
IT eT ert Fk BN I See lS 
ee i | Ep ANEErra ANEyErry oi a 
T1 units) 
4 
E == : paca i SE i (x . 
AEp=qq ( E? 7a ) (in cgse units) 


16.2.3 When point charge q’ is moved in a field set up by a 
system of point charges (qi, d2, ---, qn), the change in the 
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potential energy of charge q’ is 


nr 
AL 1 5 qi qi . i 
\E n= wt =) in SI units 
p 1 2 ( ÅNEgErr jo ANE Ey 74 ( p My 
i= | 
n 


t : di qi e . 
AEn = -4-2 | in cgse units 
p d 2; í Ertis Errii ( 8 )s 


i=1 


where r;,; and rj. are the distances between charges y; and q’ 
before and after the displacement of the latter. 

16.2.4 To find the absolute value of the potential energy of 
an electric charge at a given point of an electrostatic field, it 
is necessary to choose a reference point (3.3.4) for the potential 
energy. Integrating the cquation in Sect. 16.2.2, we obtain 
for the general case: 


Ul 


E ae ee 


P Ane pr 


where C is an arbitrary constant. If we assume that Ep = 0 
as r—> oo, then C = O and the potential energy of charge q’, 
located in the field of charge q at the distance r from this charge, 
equals 


a E ad ae 
Ep = NRE (in SI units), 


Ep = — (in cgse units). 
r 


For like charges q and q’, the potential energy of their inter- 
action (repulsion) is positive and increases (decreases) as the 
charges move closer together (away from each other). In attrac- 
pion of unlike charges Ep < 0 and increases to zero as one of 
the charges approaches infinity. The curves showing the de- 
dendence of the potential energy of two point charges on the 
tistance between them are given in Fig. 16.2. 
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16.2.5 The potential energy Æp of charge gq’, located in a field 
set up by a system of point “charges dı» 92, -+ +» gnis equal 
to the sum of the potential energies Z;, in the fields set up by 
each of the charges separately: 


n n 
San , qi . A 
Ey = `, ip = > condi 
p Eip=4 inaen, (in SI units), 


i=1 i=1 


where r; is the distance between charges q; ana g’. 


like charges 


\ Unlike charges 


Fig. 16.2 


16.2.6 The potential of an electrostatic field is an energy charac- 
teristic of the field, it is the ratio of the potential energy /, 
of charge q’ to the magnitude of the charge 


n 
a 
7 DE eT (in SI units). 


This ratio is independent of the magnitude of the charge q’ 
and is numerically equal to the potential energy of a unit test 
charge (15.1.3) placed af the point being considered in the 
field. 
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Examples: 
(1) The potential of an electrostatic field set up by one point 
charge q in a homogeneous and isotropic dielectric is 


V = ——* — (in SI units), 


ANE Err 


| 


q 
a (in cgse units). 


) The potential of a field set up by a charged conductor 
6.4.4) in a homogeneous and isotropic dielectric is 


= 
-g 


r= = 9 zas (in SI units), 


(in cgse units), 


where o is the surface charge density on the conductor (15.2.3) 
and dS is an element of surface of the conductor. 

(3) The potential of’an isolated conducting sphere of radius R, 
having the charge q, is 


= q f ; 
V = keh (in SI units), 
Roos | q s e 
yV = eR (in cgse units). 


16.2.7 The work W done by electric forces in moving charge q’ 
from point a to point b of an electrostatic feld is 


W = Eip — Eon = g' (Vi— Va) = g AV, 

where £,, and Ep are the potential energies of charge q’ at 
points a and b, V; and V, are the potentials of the field at the 
same points, and AV = V, — V, is the potential difference. 
If point b is at infinity, Ep = 0 and V, = 0. Then the work 
Woo required to move charge q’ from point a to infinity is 
equal to 


Woo = Exp = q Vi- 
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Owing to the arbitrary nature of point a subindex 1 can be 
discarded. Then 


Hence, the potential at a given point of an electrostatic field 
is qaumerically equal to the work done by electrostatic forces 
in moving a unit positive charge from the point in the field to 
infinity. This work is also numerically equal to that done by 
external forces (acting against the forces of the electrostatic 
field) in moving a unit positive charge from infinity to the given 
point in the field. 

16.2.8 In the study of electrostatic fields, all the problems require 
a knowledge of the potential difference between any points 
in a field rather than the absolute values of the potentials at 
these points. Therefore, the choice of a point of zero potential 
depends only on its convenience for solving the given problem. 
It frequently proves convenient to assume that the potential 
of the earth is equal to zero. 


16.3 Relation Between the Potential and Strength 
of an Electrostatic Field 


16.3.1 On the basis of the equations in Sects. 16.2.4 and 16.2.6, 
an element of work 6W done in the infinitesimal displacement 
of charge q’ in an electrostatic field is 


dW = qg'E cosa dl = —dE, = —g' dV, 
or 
E cosa dl = —dV; Edl = —dV. 


But dl cos œ = dl, where dl, is an element of length of the line 
of force (15.1.6) (Fig. 16.3). Hence, E = —dV/dl,. The deriv- 
ative dV/dl, is the rate of change of the potential along a line 
of force and is numerically equal to the change in potential 
per unit length of a line of force. 
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16.3.2 The projection Æ; of vector E on the direction of motion 
dl is equal to: E; = E cosa. Therefore 


dV 
dt ° 


It is obvious that E; < E and | dV/dl | < | dV/dl, |. Hence, 
E, and dV/dl reach their maximum values when dl is tangent 
to the line of force. 

In the vicinity of a given point in an electrostatic field, the 
potential varies most rapidly in the direction of a line of force. 
The sense of vector E is to- 
ward the most rapid decrease 
in potential. | 

16.3.3 In amore general form 
the relation between the strength 
and potential of an electro- 
static field is 


E = —grad V, 


E= — 


Fig. 16.3 


where grad V is the potential 
gradient vector, directed toward 
the most rapid increase in the potential and numerically equal 
to its rate of change per unit length in this direction. If the 
potential V is regarded as a function of the three Cartesian coor- 
dinates of the given point in the field, then 


ôV. OV ,, ôV 
grad V= —— rape! on k, 


where i, į and k are unit vectors along the OX, OY and OZ 
axes. The projections of the electrostatic field strength vector 
on the coordinate axes are related to the field potential by the 
equations: 

oV oV ôV 


Ex Ir Or? y= Ta and Lymer 

16.3.4 When an electric charge is moved in a direction dl, per- 
pendicular to a line of force, i.e. perpendicular to vector E, 
E, = 0 and dV/dl = 0 (16.3.1), i.e. V = const. The potential 
is the same at all points of a curve orthogonal to a line of force. 


16—0186 
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The geometrical locus of all points having the same potential 
is called an equipotential surface. It follows from the aforesaid 
= equipotential surfaces are everywhere orthogonal to lines 
of force. 

The work done in moving an electric charge along one and the 
same equipotential surface equals zero. 

An infinite number of equipotential surfaces can be constructed 
about any system of electric charges. They are usually construct- 
ed so that the potential difference is the same between any two 
adjacent equipotential surfaces. 

16.3.5 There are two methods for the graphical representation 
of electrostatic fields: by means of lines of force and by means 
of equipotential surfaces. If the arrangement of the lines of 
force of an electrostatic field is known, equipotential surfaces 
can be constructed. Conversely, knowing the arrangement of 
the equipotential surfaces, the magnitude and direction of the 
field strength can be found for each point of the field, i.e. the 
lines of force can be constructed. 


16.4 Conductors in an Electrostatic Field 


16.4.1 Solid metal conductors contain current carriers. These 
are conduction electrons (free electrons) that can be moved 
by the external electric field through the volume of the con- 
ductor. Conduction electrons appear when a conducting metal 
goes over from a less condensed to a more condensed state, 
i.e. from the gaseous to the liquid or solid state. At this valence 
electrons (39.3.9) are collectivized; they are separated from 
“their” atoms and form a special electron gas. 

16.4.2 The electrical properties of conductors under electrc- 
static conditions are determined by the behaviour of the con- 
duction electrons in the external electrostatic feld. In the 
absence of an external electrostatic field, the electric fields of 
the conduction electrons and the positive ions of the metal 
(40.1.3), the atomic cores, mutually compensate each other. 
When a metal conductor is placed into an external electro- 
static field, this field redistributes the conduction electrons 
in the conductor so that at any point within the conductor the 
conduction electrons and the positive ions compensate the 
external electrostatic field. 
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At any point inside a conductor located in an electrostatic 
licld, the strength of the resultant steady-state electric field 
is equal to zero. 

16.4.3 At the surface of the conductor the field strength vec- 
tor E should be normal to the surface. If this were not so the 
tangential component E, of vector E would cause the charges 
lo move along the conductor’s surface, and this would contra- 
dict the fixed distribution of the charges. This leads to the 
following consequences: i 

(a) E = 0 at all points inside the conductor; at all points on 
its surface E = Ep (and E, = 0), where E, is the normal com- 
ponent of the field strength vector; 

(b) the whole volume of a conductor in an electrostatic field 
is equipotential, because, for any point within the conductor, 


av 
dl 


(c) the surface of the conductor is also equipotential (16.3.4), 
because for any line on the surface 


“=E= 0 and V= const; 

(d) in a charged conductor the uncompensated charges are locat- 
ed only on its surface. This follows from the Ostrogradsky- 
Gauss electric flux theorem (15.3.3), according to which the 
total charge g inside a conductor and enclosed within a certain 
volume bounded by the arbitrary closed surface S is equal to 


-=—-Ecosa=0 and V=const;. 


(= De = D dS cosa=0, 
S 


because D = Q at all points on the surface. 

16.4.4 When the electrostatic field is set up by a charged con- 
ductor the displacement and strength of this field close to the 
surface of the conductor are found by the equations 


Dì=0 and EL y= 


(in SI units), 


ofr 

where n is the outward normal to the conductor’s surface, o is 
the surface charge density (15.2.3), ep is the relative permittiv- 
ity (dielectric constant) of the medium (14.2.4) and g is the 
electric constant in SI units (14.2.7). 


16* 
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CHAPTER 17 CAPACITANCE 


17.1 Capacitance of an Isolated Conductor 


17.1.1 Upon an increase in the amount of electricity transmitted 
to an isolated conductor, the charges are distributed over the 
conductor’s surface with varying surface density. The way in 
which the charges are distributed depends only on the shape 
of the conductor, but not on the amount of electricity that was 
previously on the conductor. Each new portion of charges is 
distributed over the surface of the conductor similar to the 
preceding portion. It follows that, at any point on the con- 
ductor’s surface, the surface charge density o (15.2.3) increases 
in proportion to the charge q on the conductor: o = kq, where 
k = f (x, y, z) is a function of the coordinates of the point 
being considered on the surface. 

The potential of the field set up by a charged conductor (16.2.6) 
is 


1 $ ods q È k dS 
V = ———— 20 — = 9 -———_ 
ANE yEr r ANE Er r 


For points on the surface S of the conductor this integral de- 
pends only on the size and shape of the conductor’s surface. 
Hence the potential V of the conductor is proportional to its 
charge q. 

17.1.2 The ratio of the charge q of an isolated conductor to its 
potential V is called the capacitance of the given conductor: 


> ~1 
C = a or C=AnNepey (4 ae) (in SI units). 
S 


The capacitance of an isolated conductor is numerically equal 
to the charge that changes the potential of the conductor by 
one unit. 

The capacitance of an isolated conductor depends upon its 
shape and dimensions. For geometrically similar conductors, 
their capacitances are proportional to their linear dimen- 
sions. 
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The capacitance of an isolated conductor depends upon the 
dielectric properties of the surrounding medium. The capaci- 
tunce of a conductor in a homogeneous isotropic medium is 
proportional to the relative permittivity (dielectric constant) 
(14.2.4) of the medium. 

17.1.3 It follows from the equation for the potential of an 
isolated conducting sphere of radius R (16.2.6) that the capa- 
vilance of such a sphere is 


(== AREER (in SI units), 
(= eR (in cgse units). 


17.2 Mutual Capacitance. Capacitors 


17.2.4 When there are other conductors close to conductor A, 
the capacitance of the latter is greater than for an identical, 
but isolated, conductor. This is due to the fact that when the 
charge q is transferred to con- 
ductor A, charges are induced 
on the conductors surrounding 
i under the influence of the 
field set up by conductor A. 
The portions of the surrounding 
conductors nearest to the induc- 


ing charge q become opposi- 
lely charged (Fig. 17.1). These 
charges weaken the field set up | 


by charge q, lower the poten- 

tial of conductor A and in- 

crease its capacitance. Fie. 17.4 

17.2.2 In a system of two con- oan 

ductors arranged close to each 

other, having charges q equal in magnitude but opposite in 
sign, a potential difference V, — V is produced that is pro- 
portional to q: 


4 


1 
heks d, 


where C = g/(V; — Va) is the mutual capacitance of the two 
conductors, 


} 


246 CAPACITANCE [17 


The mutual capacitance of two conductors is numerically equal 
to the charge that it is necessary to transfer from one conduc- 
tor to the other to change the potential difference between them 
by one unit. 

17.2.3 The mutual capacitance C of two conductors depends 
upon their shapes, dimensions and relative positions. Moreover, 
C depends upon the dielectric properties of the medium surround- 
ing the conductors. If the medium is homogeneous and iso- 
tropic, C is directly proportional to the relative permittivity 
(14.2.4) of the medium. 

When one of the conductors is moved away to infinity, the 
potential difference V, — V, between them increases and their 
mutual capacitance is reduced, tending to the capacitance of 
the remaining isolated conductor. 

17.2.4 A capacitor is a system of two charged conductors, in- 
sulated from each other and having charges of equal magnitude 
and opposite sign, provided that their shapes and relative 
position are such that the conductors set up a concentrated 
electrostatic field in a limited region of space. In the case of 
parallel-plate capacitors, the conductors are called plates. The 
capacitance of a capacitor is the mutual capacitance of its plates 
(or, in the general case, conductors). 

17.2.5 The capacitance of a parallel-plate capacitor, consisting 
of two metal plates, is 


C= Sord (in SI units), 
_ Er i l 
C= Zid (in cgse units), 


where g, is the relative permittivity of the medium filling the 
space between the plates, S is the area of each plate, and d 
is the distance between them. For a multiple-plate capacitor 
consisting of n plates, the formula for the capacitance contains 
S(n — 1) instead of S. The capacitance formula is valid only 
for small d values, when the lack of uniformity of the electro- 
static field at the edges of the capacitor plates can be ignored. 

17.2.6 A spherical capacitor consists of two concentric hollow 
metal spheres A and B, of radii r; and r, (Vig. 17.2). The spheres 
are charged on their surfaces and set up fields only outside 
themselves (15.1.4). Hence, in the space between the spheres 
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the field is set up only by the charge of sphere A. Outside the 
capacitor, the fields of the oppositely charged spheres A and B 
mutually eliminate each other. 

a eapactiance of a spherical capacitor is calculated by the 
ormula 


__ AM gry Po 
Ps — ri 


Y 


(in SI units), 


Errio , 
C= — (in cgse units). 
ro — Tg 


As rg > œ and 1/r, > 0, the inner sphere becomes an isolated 
one and C = 4ne£,ri (cf. 17.1.3). For any finite values of r, 


and ro 


: r 
To — ry 

i.e. the capacitance of a spherical capacitor is greater than that 

of an isolated hollow sphere. 


Fig. 17.2 Fig. 17.3 


17.2.7 A cylindrical capacitor consists of two hollow coaxial 
metal cylinders of height h and radii r, and r, (Fig. 17.3). The 
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formula for the capacitance of a cylindrical capacitor (or of a 
coaxial cable) is of the form 


MDN 2NEoErh 


CSS (in SI units), 
T9 
ln — 
ry 
c= et (in cgse units). 
2 ln —— 
ry 


17.2.8 All types of capacitors are characterized by their break- 
down voltage, which is a potential difference across the plates 
at which an electric discharge (22.5.1) passes through the layer 
of dielectric in the capacitor. The breakdown voltage depends 
upon the properties of the dielectric, its thickness and the 
shape of the plates. 

17.2.9 To obtain a high capacitance capacitors can be connecied 
in parallel. This is done by connecting the plates of like charge 
together. Here the total capacitance C is 


n 
C=) Ca 
i=1 


where C; is the capacitance of the ith capacitor. 

17.2.10 When capacitors are connected in series their oppositely 
charged plates are connected together. The quantities added 
together in this case are reciprocals of the capacitance C; of 
each capacitor: 


i=i 


The resultant capacitance C is always less than the minimum 
capacitance of any capacitor in the bank. Upon a connection 
in series, the possibility of a breakdown (17.2.8) of the capac- 
itors is reduced because the potential difference across cach 
capacitor is only a part of the total potential difference across 
the. whole bank of capacitors, 


18.1] DIPOLE MOMENTS OF DIELECTRIC MOLECULES 249 


CHAPTER 18 DIELECTRICS 
IN AN ELECTRIC FIELD 


18.1 Dipole Moments of Molecules of a Dielectric 


18.1.1 Dielectrics are substances that do not conduct electric 
current. At not especially high temperatures and under con- 
ditions when diclectrics are not subject to very strong electric 
fields, these substances, in contrast to conductors, have no 
carriers of electric current. 

18.1.2 The molecules of a dielectric are electrically neutral 
and contain an equal number of positive and negative charges. 
Nevertheless, the molecules do have electrical properties. As 
a first approximation, a molecule of a dielectric can be regarded 
as a dipole having the electric dipole moment pẹ = ql (15.2.3), 
where q is the magnitude of the total positive (and also total 
negative) charges, located, respectively, at the centres of gravity 
of the charges, l is the distance between the centres of gravity 
of the positive and negative charges. As any dipole, a molecule 
of matter sets up an electric field in the surrounding space 
(15.2.3). F 

18.1.3 A dielectric is said to be nonpolar (a nonpolar dielectric) 
if the electrons of the atoms in its molecules are arranged sym- 
metrically with respect to their nuclei (Hə, Oa, CCl,, etc.). 
In such molecules the centres of gravity of the positive and 
negative charges coincide when there is no external electric field 
[2 = 0 (18.1.2)] and the dipole moment p, of the molecule is 
equal to zero. When a nonpolar dielectric is placed into an 
external electric field, the electron shells (39.3.6) in the atoms 
(or molecules) are deformed and the centres of gravity of the 
positive and negative charges are displaced with respect to 
each other (l -+ 0). At this an induced electric dipole moment 
appears in atoms (or molecules) of the dielectric. This moment 
is proportional to the strength E of the external electric field: 


Pe = € aE (in SI units), 
Pe = &E (in cgse units), 


where a is the polarizability of the molecule (or atom) and £- 
is the electric constant in SI units (14.2.7). 
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The polarizability of a molecule depends only on its volume. 
It is important that a does not depend upon temperature. The 
thermal motion of the molecules in nonpolar dielectrics has no 
effect on the induction of dipole moments in them. Molecules 
with such dipole moments are like quasi-elastic (induced) di- 
poles. 

18.1.4 A polar dielectric is one in which the molecules (or atoms) 
have electrons arranged asymmetrically with respect to their 
nuclei (H,0, HCl, NH;, CH,Cl, etc.). In such molecules the 
centres of gravity of the positive and negative charges do not 
coincide, but are at a practically constant distance I from each 
other. With respect to electrical properties, molecules of polar 
dielectrics are like rigid dipoles that have a permanent dipole 
moment P, = const. 

18.1.5 A permanent dipole placed in a homogeneous external 
electrostatic field is subject to the action of a force couple (4.1.6) 
with the moment of force 
(torque) M equal to 


M = [p,E]. 


The moment M of the force cou- 
pleis perpendicular to the plane 
passing through vectors p, and 
E. From the tip of vector M 
rotation through the smaller 
angle from vectors p, to E is 
seen to be counterclockwise. In 
Fig. 18.4 the moment of force M 
is directed normal to the diagram into the page and tends to 
turn the dipole so that pe and E are parallel to each other. In 
real molecules of polar dielectrics, the molecules are deformed 
in addition to the turning of the dipole axes into alignment 
with the field. This leads to the induction of a certain dipole 
moment (18.1.3). 

18.1.6 When a permanent dipole is placed into a nonhomo- 
geneous electric field, in which the field strength E varies along 
the length Z of the dipole, then, in addition to the torque M 
(18.1.5), the dipole is subject to force F, which is equal to 


ôE 
F= pe “oy 
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or, in the more general form: 


p ôE ôE ôE 
I = Dex ar Pu gy T Pez Oz? 


where Pex: Pey and pe, are the projections of vector pe, on the 
axes of a Cartesian coordinate system. In particular, if vector E 
is directed along the OX axis and depends only on the coor- 
dinate z [Ey = E (z), E} = E, = 0 and E = £,i], then 


ôE 
6x ` 


l = Pex 


By the action of force F a free permanent dipole tends to move 
to a region of highest field strength. 

18.1.7 The potential energy E, of a permanent dipole in an 
external electric field is 


Ep = —p.E = —p,F cos 0, 


where p, is the electric dipole moment, E is the field strength 
at the point where the dipole is located and 0 is the angle be- 
tween the dipole axis and vector E. The minus sign indicates 
that the stable position of the dipole corresponds to the angle 
0 = 0, at which the potential energy of the dipole has its mini- 
inum value and vector p, is directed along vector E. 


18.2 Polarization of Dielectrics 


18.2.1 When a polar dielectric (17.1.4) is not in an external 
electric field, the vectors of its dipole moments are chaotically 
oriented as a result of the random thermal motion of the mole- 
cules. Consequently, the sum of the dipole moments of all the 
molecules is equal to zero in any physically infinitesimal 
volume AV*. 

No dipole moments of the molecules whatsoever can be induced 
(18.1.3) in a nonpolar dielectric that is not in an external 
electric field. 


* Here AV® vg, where vy is the volume of a single molecule, 
and an immense number of molecules are contained in vol- 
ume AFV. 
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18.2.2 When a dielectric is placed in an external electric field, 
it is polarized. This means that in any element of volume AY, 
the total dipole moment of its molecules is no longer equal to 
zero. A dielectric in such a state is said to be polarized (polar- 
ized dielectric). In accordance with the structure of the mole- 
cules (or atoms), distinction is made between the following 
three types of polarization: 

(a) Orientation polarization of polar dielectrics (18.1.4). The 
external electric field tends to Ane the dipole moments of the 
permanent dipoles along the direction of the field (18.1.5). 
This is hindered by the random thermal motion of the molecules 
which tends to disarrange the dipoles arbitrarily. The combined 
action of the field and thermal motion results in preferred orien- 
tation of the electric dipole moments along the field. This 
polarization increases with the strength of the electric field 
and with a drop in temperature. 

(b) Electronic polarization in nonpolar dielectrics (18.1.3) In 
the molecules of dielectrics of this type dipole moments directed 
along the field are induced (18.1.3) by the action of the external 
electric field. The thermal motion of the molecules has no 
effect on electronic polarization. In gaseous and liquid dielec- 
trics, electronic polarization takes place practically simulta- 
neously with orientational polarization. 

(c) Ionic polarization in solid dielectrics having ionic crystal 
lattices (40.1.3), such as NaCl and CsCl. The external electric 
field causes displacement of all the positive ions in the direction 
of the field strength E and all the negative ions in the opposite 
direction. 

18.2.3 A quantitative measure of the polarization of a dielec- 
tric is the polarization vector P,. The polarization vector (or 
simply the polarization) is the ratio of the electric dipole moment 
of a small volume AV of the dielectric to this volume: 


where pe; is the electric dipole moment of the ith molecule, 
and n is the total number of molecules in the volume AV. This 
volume should be so small that the electric field (45.1.2) within 
it can be assumed uniform. At the same time, the number n 
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of molecules in it should be sufficiently large to allow statistical 
methods of investigation (8.2.2) to be applied. 

18.2.4 For a homogeneous nonpolar dielectric (18.1.3) jn a uni- 
form electric field 


Pe = NoPe: 


where n, is the number of molecules in unit volume, and p, 
is the dipole moment of a single molecule. Making use of the 
formula for pe (18.1.3) we obtain 


Pe = NgoEoQE = £€XeE (in SI units), 
Pe = nok = xb (in cgse units), 


where Xe = ma is the dielectric (or electric) susceptibility of the 
substance. 

18.2.5 For a homogeneous polar dielectric (18.1.4) in a uniform 
electric field, Pe = no (pe), Where (pẹ) is the average value 
of the component along the field of the permanent dipole 
moment of the molecule. When a polar dielectric is in a weak 
external electric field, the dielectric susceptibility is calculated 
by the Debye-Langevin formula: 


= nop 
Xe = 3EokT 
2 

e 


Xe = “SET 


Here k is Boltzmann’s constant (8.4.5), and T is the absolute 
temperature. The remaining notation is given in Sect. 18.1.3. 
Shown in Fig. 18.2 is the dependence Xe = Xe (1/T) for non- 
polar (a) and for polar (b) molecules. The straight line (b) does 
not pass through the origin of coordinates because orientational 
and electronic polarization (18.2.2b) usually occur in polar 
molecules and the dielectric susceptibility consists of two parts: 
Xe = Xe + Xe Where Xe and xg are expressed by the equations 
in Sects. 18.2.4 and 18.2.5. 

48.2.6 If a dielectric is in a uniform electric field, the electrical 
neutrality of any volume AV of the dielectric, containing a suf- 
ficiently large number of molecules, is provided for by the mutual 


(in SI units), 


(in cgse units). 
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compensation of the dipole charges of opposite sign alongside 
one another (Fig. 18.3). In the thin layers at surfaces S, and S3 
of the dielectric, bounding its volume, surface-bound polarization 
charges appear as a result of the polarization of the dielectric. 
At the surface where the lines of force (15.1.6) of the external 
electric field enter the dielectric, negative charges of the ends 


A 
b 
a 
0 tL 
7 
Fig. 18.2 Fig. 18.3 


of the dipole molecules appear. Positive charges appear at the 
opposite surface. The surlace density (15.2.3) Op of the polar- 
ization charges is calculated by the formula 


Op = Pens 


where P en is the projection of vector P, on the outward normal 
to the surface of the dielectric. 

In a nonuniform electric field, the polarization of a dielectric 
is also nonuniform and the polarization vector P, depends upon 
the coordinates. Hence, in addition to surface-bound charges, 
space polarization charges also appear. These are distributed 
with the volume charge density (15.2.3) pp: 


Pp = —div Pa 


where div Pe= re | Se ee is the divergence of 


the polarization vector. 
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18.3 Relation Between Displacement, Field Strength 
and Polarization Vectors 


18.3.1 Distinction is made in substances between two kinds of 
charges: free and bound. 

Bound charges are those included in the composition of atoms 
and molecules, as well as the charges of ions in crystalline solids 
with an ionic lattice (40.1.3). Charges not bound to the afore- 
mentioned particles of matter are said to be free. Free charges 
include: (a) current-conducting charges in conducting media 
[conduction electrons in metals and semiconductors (16.4.1), 
ee in semiconductors (41.10.3), ions in electrolytes and 
gases], 

(b) surplus charges that are transmitted to a body by various 
means and violate its electrical neutrality. An example are 
charges imparted from outside onto the surface of a dielectric. 
18.3.2 Generally, an electric field is set up in a dielectric by 
both free and bound charges. The field strength vector E charac- 
terizes the resultant field in the dielectric, set up by the two 
kinds of charges, and depends upon its electrical properties: 
the relative permittivity (dielectric constant) ¢, (14.2.4). But 
the primary source’of an electric field in a dielectric is the free 
charges. As a matter of fact, a field of bound charges is set up 
in a dielectric as a result of its polarization when it is placed 
in an external electric field set up by free electric charges. 
18.3.3 The information contained in Sects. 18.3.1 and 18.3.2 
call for a certain revision of the See a theorem 


(15.3.3). The algebraic sum of the charges > q; on the right- 
i=1 
hand side of the theorem is to be regarded as the algebraic 


sum of the free electric charges enclosed by surface S, i.e. 
k 


>) % = tree, and the theorem is then of the form 
ist 


S p as= Dy dS = qfree (in SI units), 
5 S 


$ D as= Dyn dS = 4ng free (in cgse units). 
S S 
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In this form the Ostrogradsky-Gauss theorem is valid for an 
electric field either in homogeneous and isotropic or in nonho- 
mogeneous and anisotropic media (cf. 15.3.3). 

18.3.4 For an electric field in vacuum 


D = £E (in SI units), 
D=E (in cgse units). 


The flux of the field strength vector E through an arbitrary closed 
surface S in vacuum is 


) cob n dS =Qtree (in SI units), 
S 


$ En dS = 4ngfree (in cgse units). 
S 


In accordance with Sect. 18.3.2, the flux of vector E for a field’ 
in a substance is 


È EEn dS = treet 4pna (in SI units), 
S 


È En dS = 4n (drree+ dona) (in cgse units). 
S 


The sum of bound charges gpnq within closed surface S is cal- 
culated by the equation 


dpnd = -0 Pen aS, 
S 


where Pen is the projection of the polarization vector on the 
outward normal to surface dS. Hence 


(cont Pen) dS = arree (in SI units), 
S 


Ç (En-H4nPen) dS = 4ni free (in cgse units). 
5 


Ini] FERROELECTRIC MATERIALS 257 


\ correlation with the general formulation of the Ostrogradsky- 
Gauss theorem (18.3.3) enables a relation to be established be- 
(ween vectors D, E and P,: 

l)a  EoEn + Pen (in SI units), 

Da En + 4nPe, (in cgse units), 


or, by virtue of the arbitrary nature of outward normal n, 
I) -= e,E + Pe (in SI units), 
D E- 4nP, (in cgse units). 


These formulas are generalizations of the formulas in Sect. 15.3.1. 
Making use of the results obtained in Sect. 18.2.3, we have 
lor an isotropic homogeneous medium 


D £E + EoX%eE = & (1 -+ Xe) E (in SI units), 
Ib = E+ 4nye9E = (1+ 4nye) E (in cgse units). 
Consequently 

D = e,¢,E, where e, = 1-++ xe (in SI units), 

D = e, E, where e = 1-+ 4n%e (in cgse units). 


The quantity e, is the relative permittivity (dielectric con- 
stant) (cf. 14.2.4). 

lor vacuum €, = 1 and Xe = 0. 

The concept of'e, introduced in Sect. 14.2.4 has a meaning 
only for isotropic homogeneous media. 


18.4 Ferroelectric Materials 


18.4.1 Ferroelectrics belong to a group of crystalline dielectrics. 
The first substance of this type to be investigated was Seignette’s 
salt (potassium sodium tartrate) NaKC,H,O,-4H,O and, con- 
sequently, they are sometimes called seignette-electrics. An- 
other example of, a ferroelectric is barium titanate BaTiO}. 
Typical of ferroelectrics is a drastic increase in the relative 
permittivity (dielectric constant) (14.2.4) in a definite tem- 
perature range (Fig. 18.4). 

18.4.2 The relative permittivity ¢, and the dielectric suscepti- 
bility ye (18.2.4) of ferroelectrics are functions of the field strength 


1 7-—0186 
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E in the substance (Fig. 18.5). As a result, no linear relation 
between vectors P, and E is observed in these substances. The 
dependence of the electric displacement D on the field strength 


40 60 80 100 120 Moc t 0 i T 
Fig. 18.4 gee? 
D 
9 
Fig. 18.6 


is of a complex nature and a linear relation between D and E 
exists only at very high E values (Fig. 18.6). 

18.4.3 A monocrystal of a ferroelectric is divided into spontane- 
ously polarized regions, called domains (cf. 26.5.4). This spon- 
taneous polarization of the domains is a result of the orientation 
of the dipole moments of the molecules in the domain in a defi- 
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nite direction. In the absence of an external electric field, the 
polarization vectors of the various domains are directed at ran- 
dom and for a large monocrystal or a polycrystal the total 
polarization is equal, on an average, to zero. When a ferro- 
electric is placed in an electric field, the electric moments of 
ihe domains are reoriented and the crystal acquires a total 
polarization not equal to zero. 

18.4.4 Domains are formed in ferroelectrics within a definite 
temperature range, between the upper and lower Curie points 


OP and cree Coe 26.5.2). For Seignette’s salt (also called Rochelle 


salt) (18.4.1), O2P = 298 K and O/©¥ = 258 K. At T > OUP 


the forces of interaction between the dipoles cannot resist the 
thermal motion, the spontaneous polarization of the domains 
is violated and the ferroelectric is converted into an ordinary 
polar dielectric. The drastic increase in heat capacity of the 
substance indicates that a second-order phase transition (12.4.2) 
occurs at the Curie point. A disordered phase exists above the 


Curie point 6G? and the dielectric is not polarized in the absence 


of an external field. At T < 0¢? there is an ordered phase indi- 


cated by the spontaneous po- 
larization in the domains*. 
18.4.5 The phenomenon of di- 
electric, or ferromagnetic, hystere- 
sis (lag) is observed in ferro- 
electrics. As is evident from 
‘ig. 18.7, with an increase in the 
strength of the external field, 
the magnitude of vector P, 
increases and pra satura- 
tion at point a. Then, as F is . 

aces zero, a ferroelectric Fig. 18.7 

has remanent polarization, char- 

acterized by the value of the polarization vector Pg. 
Polarization completely disappears only after applying a depo- 
larization electric field of the strength —E, in the opposite 


* A discussion of the phenomena that occur in ferroelectrics at 
C= glow and lead to the destruction of the domains is beyond 
the scope of the present handbook. 


{7* 
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direction. This is called the coercive force. Periodic reversal of 
polarization of a ferromagnetic material is accompanied by 
the consumption of electric energy, spent in heating the sub- 
stance. The area of the hysteresis loop is proportional to the 
electric energy that is converted into internal energy per unit 
volume of the ferroelectric per cycle. 


CHAPTER 19 ENERGY OF AN ELECTRIC FIELD 


19.1 Energy of a Charged Conductor and an Electric 
Field * 


19.1.1 The charging of an electric conductor requires the per- 
formance of work that must be done in overcoming the Coulomb 
forces of repulsion between like charges. This work increases 
the electrical energy of a charged conductor in the same sense 
as the potential energy in mechanics (3.3.1). The work 6W done 
in bringing a charge dq from infinity to the conductor is 


ôW = V dq = CV dV, 


where C and V are the capacitance and potential of the conduc- 


tor. 
The work required to charge a conductor from zero potential 
to the potential V is 


V 
2 
w= f evar- 
0 


Correspondingly, the energy of a charged isolated conductor 
(intrinsic energy of a charged conductor) is 


* Tt is assumed in this chapter that the electric charges and 
charged bodies are in an isotropic, homogeneous, nonferromagne- 
tic (18.4.1) medium. 
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The energy of a charged capacitor is 


p — LEAVY _ P. 41 
De OE SD 
where C and q are the capacitance and charge of the capacitor, 
and AV is the potential difference between oppositely charged 
plates of the capacitor. 
19.1.2 The energy of a system of charges, distributed continu- 
ously throughout the volume of a dielectric or vacuum, and 
along the surfaces of charged conductors and an electrified dielec- 
(ric, is 


E=— \ Vo as +> \ Vo dv 
char Vehar 


where o and p are the surface and volume density of free charges, 
V is the potential of the resultant field of all the surface and 
volume charges at the points of the elements dS or dv of the 
charged surface or volume (to avoid confusion, volume is denot- 
ed here by the lower-case v). Integration is carried out over all 
charged surfaces Sepa, and throughout all charged volumes 
Vehar. Lhe effect of the dielectric is that with a constant dis- 
tribution of the free charges the values of V differ for the differ- 
ent dielectrics. Thus, in a homogeneous isotropic dielectric 
that fills the whole field, V is ¢,-fold less than in vacuum. 
19.1.3 The intrinsic energy of a charged conductor is, at the 
same time, the energy of its electrostatic field. Hence, for the 
uniform electrostatic field (45.1.3) of a parallel-plate capacitor 
(17.2.5) we have 


a ue 2 seus 9 
AERE A EE L E E E 
2 2 2 
> _ Erk? ; 
Ke= ga Y (in cgse units), 


where v = Sd is the volume of the electrostatic field between 
the capacitor plates. The energy of the field is proportional 
to its volume, and the energy per unit volume of the electro- 
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static field, called the volume energy density EC, is the same 
at all points of a uniform field: 


> 2 r 
ga = He SO — (in SI units), 
d_ Ee ek ED ; 
Ea = e ae (in cgse units). 


19.1.4 In the case of nonuniform electrostatic fields, set up by 
arbitrary charged bodies, the volume energy density at each 
point of an isotropic medium is expressed by the formulas in 
the preceding subsection. But when the medium is electrically 
anisotropic, the volume energy density of the electric field is 


gi — -5 DE (in SI units), 


d 1 : : 
Ei Bn E (in cgse units) 
19.1.5 The energy d#, of an infinitesimal volume of an arbitrary 
electrostatic field in an isotropic medium is 


EErEE? 


dEe= Ef dv -= dv (in SI units). 


The total energy FE, of an electrostatic field is 


2 
Ee = \ Sort a 


vfield 


where integration is carried out throughout the whole volume 
Vaejg Of the field. 

19.1.6 The total energy of an electrostatic field set up by an 
arbitrary charged body is equal to the intrinsic energy of the 
body (19.1.4): 


172 „E2 
m 7 fetch” ae, 


bad 


veld 
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This equation can be generalized for the case of an electrostatic 
Held set up by an arbitrary system of charges. The total energy 
of such a system (19.1.2) coincides with the total energy of the 
electrostatic field set up by this system of charges: 


19.2 Energy of a Polarized Dielectric 


19.2.4 The process of polarizing a homogeneous isotropic dielec- 
(ric, placed in an external electric field; is accompanied by 
work done in deforming the electron shells in the atoms and 
molecules or in turning the dipole axes of the molecules into 
alignment with the field strength. Therefore, a polarized dielec- 
tric has a margin of electric energy. 

19.2.2 The volume energy density of a polarized dielectric at 
a point with the field strength E equals: 


in SI units). 


d __ 9 (&r — 1) 7 
£9 ator) = oe | 


The volume energy density of a field with the same strength E 
in vacuum is 


The volume energy density of a field in a dielectric is ks = 


€ y€,£2/2, and according to the energy conservation law it 
equals 


d _ pd a på 
Ie = E e(vac) | É edic 
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CHAPTER 20 DIRECT ELECTRIC CURRENT 


20.1 Concept of an Electric Current 


20.1.1 Electrodynamics is the main branch of the science of 
electricity that deals with the phenomena and processes invol- 
ved in the motion of electric charges or macroscopic charged 
bodies. The most vital concept in electrodynamics is that of 
electric current. 

20.1.2 An electric current is any ordered motion of electric charges. 
An electric current produced in conductive media as the 
result of the ordered motion of free charges by the action of an 
electric field is called conduction current. Examples of conduction 
currents are the currents in metals and semiconductors, asso- 
ciated with the ordered motion of “free” electrons, and currents 
in electrolytes, which are the ordered displacement of ions of 
opposite signs. 

A convection current is the ordered motion in space of charged 
macroscopic bodies. An example of such a current is that asso- 
ciated with the motion of the earth, having a surplus negative 
charge, along its orbit. 

20.1.3 Upon ordered motion of electric charges in a conductor, 
the equilibrium distribution of charges is violated and the sur- 
face of the conductor is no longer an equipotential surface 
(16.4.3). On the surface of the conductor there is a tangential 
component of the field strength (Eq = 0) and there should be 
an electric field inside the conductor (cf. Sect. 16.4.3). The elec- 
tric current continues until all the points of the conductor 
become equipotential. 

20.1.4 The conditions required for the initiation and existence 
of an electric current in a conducting medium are as follows: 

(a) The availability of free current carriers—charged particles— 
in the medium, capable of ordered motion in it. In metals and 
semiconductors suitable particles are the conduction electrons 
(16.4.1); in liquid conductors (electrolytes) they are the positive 
and negative ions; in gases they are the oppositely charged ions 
and electrons. 

(b) The existence in the given medium of an external electric 
field whose energy is spent in providing ordered motion of the 
electric charges. To maintain an electric current, the energy 
of the electric field must be continuously replenished, i.e. 
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a source of electric energy is required. This is a device that con- 
verts some kind of energy into the energy of an electric field. 
20.1.5 The direction of ordered motion of the positive electric 
t w is conventionally taken as the direction of the electric 
current. As a matter of fact, in metallic conductors the current 
consists of the ordered motion of electrons, which move in 
the direction opposite to that of the current. 


20.2 Current and Current Density 


“0.2.1 The current strength (or simply the current) J is a scalar 
quantity, the rate of flow of electricity. It is equal to the ratio 
of the charge dq, conveyed through the surlace* being considered 
during a small interval of time, to the amount of this time dt: 


dq 
odt’ 
An electric current is said to be direct (direci electric current) 


if its magnitude and direction remain constant with time. For 
direct current 


D. 
p @ ee 
where q is the electric charge conveyed through the surface* 
being considered during the finite time interval from 0 to t. 
20.2.2 If the current is a direct one, the charges cannot accu- 
mulate or be depleted at any part of the conductor. A direct 
current circuit should be closed and comply with the condition 
Vy, = Qs, where Qs, is the total electric charge passing in 
unit time through surface S, into the volume of the conductor 
enclosed between the cross sections S; and S,, and Qg, is the 


total electric charge emerging from this volume in unit time 
through surface So. 

2().2.3 The current density determines the current direction at 
various points of the surface being considered and the distri- 
bution of the current over this surface. The current density 


t In the case of conduction current, through the cross section 
of the conductor, 
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vector į is opposite in direction to the motion of the electrons 
that carry the current in metals* and is numerically equal 
to the ratio of the current d/J through an element of surface 
normal to the motion of the charged particles to the quantity 
dS’, which is the area of this element of surface: 


dI 
Er 
A more general relation between the current density j and an 
element of current d/ is 
dl = į d$, 


where dS = n dS is the vector of an element of surface, and 
n is the unit vector of the normal to surface dS and makes angle 
a with vector j. 

20.2.4 The current through an arbitrary surface S is 


r= | jas =\ jnas, 
S S 


where jp = j cos & is the projection of vector j on the direction 
of normal n (20.2.3), and integration is carried out over the 
whole area of surface S. If the conduction current is being cal- 
culated on the basis of the cross section of the conductor for 
which jp = j, then 


r= \ jas. 
S 


20.2.5 The density of direct current is the same over the whole 
cross section S of a homogeneous conductor. Here 


Bees: 
In a direct current circuit, the current density of two cross 


sections S, and S, is inversely proportional to the cross-sectional 
areas: 


jp Sy 


* In other conducting media vector j coincides in direction with 
that of the positively charged current carriers, 
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20.3 Fundamentals of the Classical Electron Theory 
of Electrical Conduction in Metals 


20.3.4 The high electrical conduction of metals is due to the 
fact that metals contain an immense number of current carriers, 
the conduction electrons, which are formed of the valence electrons 
(39.3.9) of the atoms of metal. These electrons do not belong 
to any definite atom, but are collective electrons. According to 
the classical Drude-Lorentz electron theory, these electrons are 
regarded as an electron gas (16.4.1) that possesses all the prop- 
erties of a monatomic ideal gas (8.4.1). 

The number of conduction electrons in unit volume of a mono- 
valent metal is 


2 A 
no = p, 
Ma 


where Na is Avogadro’s number (Appendix II), m, is the 
atomic mass of the metal and ọpọ is its density. In order of mag- 
nitude nọ ~ (1028 to 107°) m>. 

In the absence of an electric field inside the metal the conduction 
electrons move at random and collide with the ions of the crystal 
lattice of the metal (40.1.1). It is assumed that the mean free 
path (A) of the electron (10.5.1) should be of the order of mag- 
nitude of crystal lattice constants of the metal, i.e. (A) x . 
10e m; i 

The average kinetic energy of thermal motion of the electrons 
(10.2.4) is 


mv? 3 
rms 
{=F kT, 


where m is the mass and vns is the root-mean-square velocity 
of the electrons (10.2.3). At the temperature T = 273 K, the 
velocity vms Z% 10? m/s. 

The arithmetic mean velocity (uw) of thermal motion of the 
electrons (10.3.6) is of the same order of magnitude. 

20.3.2 Electric current in a metal is initiated by the action of 
the external electric field (20.1.4), which leads to ordered motion 
of the electrons. The current density j is equal to the charge 
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of all the electrons that pass in unit time through unit area of 
the conductor’s cross section, 


j = Nge (v), 


where n, is the number of conduction electrons per unit volume, 
e is the magnitude of the charge of the electron, and (v) is the 
average velocity of ordered motion of the electrons due to the 
action of the external field. At the highest current densities 
(v) is 10-* m/s and is negligibly small compared to the thermal 
velocities of the electrons (20.3.1). 

20.3.3 A steady electric current is established in a circuit in 
the time t = L/c, where LZ is the length of the circuit and c is 
the velocity of light in vacuum. The time ¢ coincides with the 
time required to establish a steady-state electric field along 
the circuit and the initiation of ordered motion of the electrons 
along the whole circuit. Therefore, electric current is started 
practically simultaneously with the closing of the circuit. 
20.3.4 Ohm’s law for current density (Ohm’s law in differential 
form) is 


E 
p 


The current density in a conductor is the product of the electrical 
conductivity x by the electric field strength E. The quantity 
ọ = 1/x is called the electrical resistivity. 

According to the classical electron theory, electrical conductiv- 
ity and resistivity are calculated by the formulas 


where n, is the number of electrons in unit volume of the metal, 
(A) is the mean [ree path of the electron (10.5.1), (w) is the 
arithmetic mean velocity of thermal motion of the electrons 
(10.3.6), and e and m are the magnitude of the charge of the 
electron and its mass. 

20.3.5 By the action of the field the electron acquires a velocity 
equal to may at the end of its mean free path. Upon collision 
with an ion at this point, the electron loses its velocity and 
the energy of ordered motion of the electron is converted into 
internal energy of the conductor, which is heated as the current 
passes through. 
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The volume density EC of the heating capacity of a current is the 
amount of energy generated in unit volume of a conductor in 
unit time. The Joule-Lenz law for the volume density of the 
healing capacity of a current is 


Il — jE= xB? = > E?. 


The Joule-Lenz law in its differential form states: the volume 
density of the heating capacity of a current is equal to the 
scalar product of the current density vector by the electric field 
strongth vector. 

The volume density of the heating capacity of a current is inde- 
pendent of the kind of collisions of the electrons with the crystal 
lattice points [elastic or inelastic collision (3.5.3)]. It follows 
trom the laws of conservation of energy and momentum that the 
energy, AE transmitted to the ion upon a collision of the elec- 
(ron with an ion, is only a small part of the electron’s energy £ 4}. 
In an inelastic collision AE/E,; = m/(m -+ M), whereas in 
an elastic collision AE/E,.; = 4mM/(m + M)?, where m is 
lhe mass of the electron and M is the mass of the ion. Practi- 
cally, in either case, AE/E a x m/M x 10-4. 

20.3.6 The Wiedemann-Franz law states that for all metals the 
ratio of the thermal conductivity K (10.8.5) to the electrical 
conductivity x is directly proportional to the absolute tem- 
perature T: 


K k \? 
xo ( e ] 
where k is Boltzmann’s constant (8.4.5) and e is the charge of 
the electron. 

20.3.7 The classical electron theory of electrical conduction 
of metals has the following shortcomings: 

(a) it cannot explain the experimentally observed linear rela- 
lion, in a wide temperature range, between the resistivity ọ 
and the absolute temperature: ọ & T; 

(b) it yields an incorrect value for the molar heat capacity of 
metals. This should be equal, according to this theory, to 
9 cal/mol-K and be the sum of the heat capacity of the ionic 
crystal lattice (6 cal/mol-K) and the heat capacity of the mon- 
atomic electron gas (3 cal/mol-K). It is known, however, from 


F; 
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the experimentally established Dulong and Petit law (41.7.2) 
that the molar heat capacity of metals differs only slightly from 
that of other solids and is approximately 6 cal/mol-K. It is. 
impossible on the basis of classical theory to explain the absence 
of an electron component of heat capacity of metals; 

(c) the experimental value of the resistivity p and the theoretical 
value of the arithmetic mean velocity (uw) of the electrons lead, 
in formulas of Sect. 20.3.4, to a value of the mean free path 
(A) of the electron that exceeds the crystal lattice constant of 
the metal by two orders of magnitude. This contradicts the 
assumptions of the classical electron theory of the electrical 
conduction of metals. 


CHAPTER 21 DIRECT CURRENT LAWS 


21.1 Extraneous Forces 


21.1.1 A metallic conductor has an electrostatic field that is 
set up by the electrons and positive ions of the crystal lattice. 
This is a field of Coulomb forces (14.2.2). Coulomb interaction 
between the charges in a metal leads to such an equilibrium 
distribution of charges that the electric field inside the conductor 
equals zero and the whole conductor is equipotential (16.4.3). 
Consequently, an electrostatic Coulomb field cannot be the 
cause of the steady-state process of ordered motion of electrons, 
i.e. it cannot generate and maintain a direct (constant) current. 
21.1.2 Eztraneous forces are nonelectrostatic forces whose 
action exerted on the conduction electrons in the conductor 
initiates their ordered motion and maintains a direct (constant) 
electric current in the circuit. In contrast to Coulomb forces, 
extraneous forces do not join unlike charges; they lead to their 
separation and maintain a potential difference at the ends of 
the conductor. 

Extrancous forces are the cause of the nonelectrostatic electric 
field, existing in the conductor and providing for ordered motion 
of the charges from points with a higher potential to points 
with a lower potential. A steady-state electric field of extraneous 
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forces is set up by sources of electrical energy (galvanic cells, 
electric generators, ctc.). 

21.1.3 In a closed hydraulic system that provides for constant 
circulation of a liquid (Fig. 21.1), the liquid travels from point A 
to point B, in the direction opposite 
to the pull of gravity, by the action 
of certain “extraneous forces” exerted 
on the liquid by pump P. The pump 
produces a constant difference in 
hydrostatic pressure between points 
B and A, and the liquid flows by 
gravity from B to A. A source of 
electrical energy plays a similar role 
in a direct current circuit. Owing to 
the electric field set up by extraneous 
forces and existing inside the source, | : 
the electric charges move within the eee: eee 
source against the forces of the elec- gh 
trostatic field. This maintains a po- 
tential difference at the ends of the 
external circuit that is required for 
a direct current to flow. At the ex- 
pense of the energy consumed in the Fig. 21.4 

source, work is done to obtain or- eee 

dered motion of the electric charges. 

In a direct current generator, for example, the work of the 
extraneous forces (21.1.2) is done at the expense of the 
mechanical energy required to drive the rotor of the generator. 


B 


21.2 Ohm’s Law and the Joule-Lenz Law 


21.2.1 At any point inside a portion of a conductor that con- 
tains a source of electrical energy there is an electrostatic field 
of Coulomb forces with the strength Ecoul and an electric field 
of extraneous forces with the strength Eextr- 

According to the principle of superposition of fields (15.2.2) 
the strength of the resultant field is 


E = Ecoul T Eextr 
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Ohm’s law for current density (20.3.4), 


1 
j= p (Ecoul + Eextr), 


enables the following relation to be obtained for portion 41-2 
of a homogeneous conductor of cross-sectional area S: 


2 i 2 2 
I | T= | Ecoul a+ | Eextr dl, 
1 1 1 


where J is the current in the conductor, dl is a vector with the 
magnitude dl, equal to an element of conductor length, and is 
directed tangent to the conductor toward the current density 
vector, and S is the a konal area of the conductor. 


21.2.2 The integral | Ecour dl is numerically equal to the 


1 
work done by the Coulomb forces in moving a unit positive 
charge from point 1 to point 2. According to Sect. 16.3.4 


2 


) Ecoul dl = Vy TF Vo, 
1 


where V, and V, are the potentials at points 1 and 2 of the con- 
ductor. 

The electromotive force (emf) €12, acting over the portion 1-2 
of the circuit, is the name given to the linear integral 


2 


612 To \ Eextr dl. 
1 


C2 


The electromotive force @;, is numerically equal to the work 
done by the extraneous forces in moving a unit positive charge 
along the conductor from point 1 to point 2. This work is done 
at the expense of the energy consumed in the source. 

The voltage (or voltage drop) Uy, across the portion 1-2 is the 
physical quantity numerically equal to the work done by the 
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resultant field of Coulomb and extraneous forces in moving 
a unit positive charge along the circuit from point 1 to point 2. 
Thus 


2 2 
a | (Ecoul + Eextr) dl = \ E dl, 
1 4 


or 
(yy = (Vi ae Vo) ZE E12 


The voltage across the ends of the circuit coincides with the 
potential difference only when no emf is applied to this por- 
Lion. . 

The integral 


f dl 
lia Pi | S 
1 


is the resistance Ry. in the portion of the circuit between sec- 
tions 1 and 2. For ahomogeneous cylindrical conductor (wire), 
p = const, S = const 

bie 
Rio =p Ka , 
where lia is the length of the conductor between cross sections 4 
and 2. 
21.2.3 The generalized Ohm’s law for an arbitrary portion of 
a circuit is 


[Ry = Ukr = (Vy aa Vo) ++ Gye. 


The product of the current by the resistance of a portion of the 
circuit is equal to the potential difference across this portion 
plus the emf of all sources of electrical energy connected in the 
given portion of the circuit. In this form Ohm’s law is applicable 
both for passive portions of the circuit, not containing sources 
of electrical energy, as well as for active portions that contain 
such sources. 

21.2.4 The sign rule for the emf of sources of electrical energy 
included in portion 1-2 states that if the current inside the 


18—0186 
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source is from the cathode to the anode, i.e. the strength of the 
field of extraneous forces coincides in direction with the current 
in this portion of the circuit, then, in calculations, the emf 812 
of this source is considered positive. If the current inside the 
source is from the anode to the cathode, the emf €,, of this 
source is considered negative (Fig. 21. 2), 

21.2.5 In a closed series electric circuit, the current is the same 
at all cross sections, and such a circuit is a portion with coin- 


7 >00 r Ê <0 2 
a Ho 
(a) (6) 

Fig. 21.2 


ciding ends (points 1 and 2 coincide). In this circuit V,; = V, 
and Rig = == R, the total resistance of the whole circuit. 
Ohm’s law for a closed electric circuit is 


IR=6, 

where @ is the algebraic sum of all the emf’s applied in the 
circuit. 

If a closed circuit consists of a source of electrical energy with 
an emf € and internal resistance r, and the resistance of the 
external part of the circuit equals R, Ohm’s law is of the form 


C7 


R+r ’ 


The potential difference across the terminals of the source is 
equal to the voltage across the external part of the circuit: 


V,—V,=RI=6§ —Ir. 


T = 


If the circuit is open and has no current (J = 0), the potential 
difference across the terminals of the source is equal to its emf: 


7 say CS 
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A voltmeter connected in parallel with portion 1-2 of a direct 
current (d.c.) electric circuit measures the potential difference 
across the ends of this portion, and not the voltage: 


byly — Vi a Vo, 


where Ry and Jy are the resistance of the voltmeter and its 
current (Fig. 21.3). This follows from the generalized Ohm’s 
law (21.2.3) as written for portion 1-2 of the voltmeter circuit, 
in which there is no emf (see also Sect. 21.3.5). 

21.2.6 In a d.c. electric circuit with fixed conductors, the work 
done by the extraneous forces is completely consumed in heating 
the conductors (20.3.5). The energy £, evolved ina circuit during 
the time ż throughout the whole 

volume of the conductor, is 7 
equal to 


k= 1TUt, 


where 7 is the current and U is 
the voltage. The quantity of 
heat Q (9.2.1) in ĉalories, cor- &! 2 
responding to this energy and 3 
evolved in the conductor, is Fig. 21.3 


() == 0.24 IUt. , 


All the other quantities are expressed in SI units (Appendix [). 
The Joule-Lenz law states that the quantity of heat generated 
by current in a conductor is proportional to the current, voltage 
and time that the current flows. Other expressions of the Joule- 
Lenz law are: . 

U2 


poas 2 l = ° ~AT Ue 
Q=0.24PRi=0.24—— t 


21.3 Kirchhoff’s Laws 


21.3.4 The analysis of complex (branched) circuits consists in 
finding the currents in the various portions of the circuit in 
accordance with their given resistances and applied emf’s. 


18* 


276 DIRECT CURRENT LAWS [21 


21.3.2 A branch point, or junction, in a branched circuit, or 
network, is a point where there are more than two possible 
directions of the current (Fig. 21.4). . 
Kirchhoff’s first law (point rule) states that the algebraic sum 
of the currents that mect at any junction of a network equals 
Zero: 

i=n 

N Ty==0, 


i—i 


where n is the number of conductors joined at the branch point 
and J; is the current at the point. Currents flowing toward the 
branch point are considered posi- 
tive; those flowing away are con- 
sidered negative. 

Kirchhof’ s second law (mesh rule) 
states that in any closed loop 
(mesh of a network), arbitrarily 
chosen in a branched circuit (net- 
work), the algebraic sum of the 
products of the currents J; by the 
resistances R; of the corresponding 
portions of the loop is equal to 
the algebraic sum of the emf’s in 
the loop. Thus 


z; i=n1 i=n1 
Fig. 241.4 5 pr 5 A 
ìi =1 i=1 


where n, is the number of portions into which the loop is divided 
by branch points (junctions). Inapplying the mesh rule, a definite 
direction around the loop is chosen (either clockwise or counter- 
clockwise). Currents coinciding in direction with the chosen 
one are considered positive. The emf’s of the sources of electrical 
energy are considered positive if they produce currents whose 
direction coincides with the chosen one. 

21.3.3 The following procedure is employed to analyse a complex 
(branched) d.c. circuit: 

(a) The directions of the currents are arbitrarily assigned in all 
the portions of the circuit. 

(b) For m branch points (junctions), m — 1 independent equa- 
tions are written on the basis of Kirchhoff’s first law. 
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(°) Arbitrary closed loops are separated out and, after choosing 
the direction for going around cach loop, a system of equations 
is written on the basis of Kirchhoff’s second law. In a branched 
circuit consisting of p portions (branches) between adjacent 
branch points and m branch points (junctions), the number of 
independent equations based on the mesh rule equals p — m + 
' {. In selecting the loop, it is necessary for each new loop to 
contain at least one portion (branch) not included in the pre- 
viously considered loops. 

21.3.4 To shunt an ammeter means to connect an additional 
resistor Rsp in parallel to the ammeter. This is done so that 


Nie. 21.5 Fig. 24.6 


the ammeter, having the resistance Ry and designed to measure 
a maximum current of Ij, can be applied to measure currents J 
that exceed current J, (fig. 21.5). The required resistance of 
(he shunt is found according to Kirchhoff’s laws: J = Io + Ish 
and ZoRo = Isnitsn, from which Isp is eliminated: 


Toho 
lsh 


Rsh = 


21.3.5 A potential difference V, — V, = U across a portion 
of a circuit is to be measured by a voltmeter designed for a maxi- 
mum measurement of Uy volts (21.2.5) at a maximum permis- 
sible current of Z (with Uo = /,/?,) and this potential differ- 
ence exceeds Uy (i.e. U > U,). This can be done by connecting 
an additional resistor R, in series with the voltmeter (Fig. 21.6). 
The required additional resistance is determined from the equa- 
lion U = (Rg -+ Ry) I), from which 


Ra == io Ro. 
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CHAPTER 22 ELECTRIC CURRENT IN LIQUIDS 
AND GASES 


22.1 Faraday’s Laws of Electrolysis. 
Electrolytic Dissociation 


22.1.1 Liquids are conductors of electric current (electrolytes, 
ion conductors) provided that ordered motion of ions can be 
a eres in them by the action of an external electric 
fields. 

The ordered motion of ions in conducting liquids takes place 
in an electric field set up by electrodes, which are conductors: 
connected to the terminals of a source of electrical energy. The. 
positive electrode is called the anode, the negative electrode, 
the cathode. Positive ions, called cations, are the ions of metals 
and hydrogen, and move toward the cathode. Negative ions, 
called anions, are the ions of acid radicals and of the hydroxyl 
group, and move toward the anode. Electric current in electro- 
lytes is accompanied by the phenomenon of electrolysis. This 
is the deposition on the electrodes, or the liberation at them, 
of the components of the substances contained in the electrolyte 
or of other substances produced by secondary reactions. 

22.1.2 Faraday’s first law of electrolysis states that the mass M 
of a substance deposited on or liberated at an electrode is di- 
rectly proportional to the electric charge Q passing through the 
electrolyte. Thus 


M = kQ = kit 


if direct current J is passed during the time ¢ through the elece- 
trolyte. 

The proportionality factor k is called the electrochemical equiv- 
alent of the substance. It is numerically equal to the mass of 
the substance liberated when a unit electric charge passes 
through the electrolyte and depends upon the chemical nature 
of the substance. 

22.1.3 Faraday’s second law of electrolysis states that the elec- 
trochemical equivalents of the elements are directly proportional 
to their chemical equivalents: 


c= Ckeh, 
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where C is a certain constant universal for all the elements, 
and ken is the chemical equivalent and is equal to 


Ma 


keh == 103 


Ilere ma is the atomic mass of the element (in kg/mol), and z 
is its valency. 

1 m, 
Ilence, k=, where F= 10-3/C is Faraday’s constant 
(Appendix II). 
22.1.4 Faraday’s united law of electrolysis, 


| 13i m 
E Q 
F z f 


enables the physical meaning of F to be cleared up: at M = 
- malz, Faraday’s constant F = Q. The amount of substance 
equal to 1/z moles is called its gram equivalent. At z = 1 the 
cram equivalent of 4 substance is equal to one mole. Faraday’s 
constant is numerically equal to the electric charge that must 
be passed through an electrolyte to deposit one gram equivalent 
of any substance on the electrode. 
22.1.5 The splitting of neutral molecules into oppositely charged 
ions as a result of interaction between the dissolved substance 
(solute) and the solvent is called electrolytic dissociation. It is 
caused by the thermal motion of the polar molecules (18.1.4) 
of the solute, consisting of interrelated, oppositely charged ions 
(39.4.3), and by the interaction of these molecules with tho 
polar molecules of the solvent. Both causes lead to weakening 
of the heteropolar bonds in ionic molecules (39.4.3) and to the 
conversion of such molecules into two oppositely charged ions. 
The degree of dissociation œ is the ratio of the number n’ of 
molecules dissociated into ions in a certain volume to the total 
Boa na Of molecules of the solute in the same volume: @ =: 
= n hy 
22.1.6 The process opposite to electrolytic dissociation (22.1.5} 
is called deionization. It consists in the recombination of ions 
of opposite signs into neutral molecules. Under conditions of 
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dynamic mobile equilibrium between the dissociation and 

deionization processes, œ is determined by the equation 

1-—a 
2 


= const X no. 


As ny > 0, we have a — 1, i.e. in weak solutions almost all 
the molecules dissociate. The degree of dissociation œ decreases 
with an increase in the concentration of the solution. In highly 
concentrated solutions 


const 


Ving 


an 


22.2 Atomicity of Electric Charges 


22.2.1 It follows from Faraday’s laws of electrolysis that all 
electric charges consist of a whole number of elementary indi- 
visible charges. 

22.2.2 The charge Q of any ion is equal to 


zF 


=e 
x n NA ? 


where z is the valency of the ion, F is Faraday’s constant 
(22.1.4), and Nag is Avogadro’s number (Appendix II). The 
charge of a monovalent ion is equal to the charge e of the elec- 
tron or proton: 


Q, = e = 1.602 x 10719 C = 4.803 XxX 107! electrostatic units. 


Any electric charge consists of a whole number of elementary 
charges e (14.1.2). 


42.3 Electrolytic Conduction of Liquids 


22.3.4 The current density j (20.2.3) in an arbitrary section SS, 
perpendicular to the direction of ion motion (Fig. 22.1), is 
equal to the sum of the current densities of the positive and 
negative ions: 


J=)++}-. 
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Here j} = ging, (v4) and j- = g_ny_ (v_), where g, and 
4, Hop and no-, and (v) and (v_) are the charges, concen- 
trations and average velocities of ordered motion (drift due 
to the action of the electric field) 
of the positive and negative ions. 
22.3.2 The average velocities of 
ion drift are proportional to the 
electric field strength E: 


(v;) =u,E and 
(v) = —u_E, 


where u4 and u. are the mobilities 
of the positive and negative ions. 
The mobility of an ion is equal to 
the ratio of the magnitude of the 
average drift velocity vector to 
that of the field strength vector. 
The mobility is independent of the 
Held strength vector E. Since there are no bulk charges in 
electrolytes, qno} F q-no- = 0. 


Fig. 22.4 


Moreover, q4} = ez, == 4 (22.2.2): 
A as 
22.3.3 Ohm’s law of current density in electrolytes (cl. Sect. 20.3.4) 


IS! ` 


hd 


j= Wa 24No4 (U+ + u_) E. 


The resistivity pọ of an electrolyte (20.3.4) is equal to 


p= - 
Fino, (Uy Uu_) * 


If a molecule of the solute dissociates into ky positive and k_ 
negative ions, then 


kyz = kz.; no+ = kany and ng- = kang, 
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where a is the degree of dissociation, ny is the concentration 
of the solute (22.1.5) and 

= NA 
p = Fzikðng (u+ u) ’ 


The ratio Na/z, is the number of positive ions in one gram 
equivalent (22.1.4). We introduce the quantity 


C= k noz4 = k ngoz- 


NA Na ’ 


which is called the equivalent concentration of the solution. 
It is the number of gram equivalents of the ions of one sign 
contained in unit volume of the electrolyte (cither in the free 
state or bound in the molecules). Then 


1 
P= Fla u, Fu * 


22.4 Electrical Conduction in Gases 


22.4.1 Gases, consisting of neutral atoms and molecules, are 
insulators and do not conduct an electric current. Gases are 
capable of conduction after they are ionized. 

The ionization of a molecule (or atom) consists in the detachment 
of one or several electrons, after which the molecule (or atom) 
is converted into a positive ion. If a molecule (or atom) gains 
electrons, it becomes a negative ion. 

- The reverse process, in which electrons join a positive ion to 
form a neutral molecule (or atom), is called recombination. 
22.4.2 To ionize a molecule (or atom) it is necessary to perform 
the work of ionization W; to overcome the forces of attraction 
between the electron being detached and the other particles 
of the molecule (or atom). The work W; depends upon the energy 
state of the electron being detached (39.1.9) in the atom or 
molecule of the given gas. The ionization energy increases 
with the electron detachment factor, i.e. with the number of 
electrons detached from each atom. 

22.4.3 The ionization potential Vi is the potential difference 
that the charged particle must traverse in an accelerating elec- 
tric lield in order to accumulate energy equal to the work of 
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ionization W;,.Thus Vi = Wi/e, where e is the magnitude of the 
charge of the particle. 

2%.4.4 A gas is ionized by external effects: raising the tempera- 
(ure sufficiently, the action of various kinds of radiation, cosmic 
radiation, bombardment of the molecules (or atoms) of the gas 
by high-speed electrons or ions, etc. The intensity, or rate, of 
nization is measured by the number of pairs of particles of 
opposite sign formed in unit time in unit volume of the gas. 
22.4.5 Collision ionization is the process in which a gas is ionized 
hy the action of fast-moving electrons or by ions. The minimum 
kinetic energy (3.2.1) that an ionizing particle must have is 
assessed by applying the laws of conservation of momentum and 
energy. Thus 


mv? m 
—W -7 | 


where W; is the work of ionization, m is the mass of the electron 
and M is the mass of the atom. 

The smaller the ratio m/M, the closer this energy is to Wi. An 
electron and a single-charge ion accumulate the same energy 
k = e AV if they traverse the same potential difference AV. 

It follows from the preceding formula that for collision ioniza- 
tion by electrons and ions, the latter of a mass 104 times that 
of the electron, ions must travel across a greater potential 
difference in an accelerating field than electrons. 


22.5 Various Types of Gas Discharges 


22.5.1 A gas discharge is the process of current conduction through 
a gas. 

A semi-self-maintained gas discharge is one caused by external 
ionizing agents (22.4.4) and ceasing when the ionizing action 
stops. The curve in Fig. 22.2 shows the dependence of the cur- 
rent J in a semi-self-maintained discharge on the voltage U 
across the electrodes. In the 1st region of the curve, at low volt- 
ages, Ohin’s law, similar to the one for electrolytes (22.3.3), 
is valid. If electrons and monovalent ions are formed in the 
gas, then 


j = eng (u4 F u) E, 
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where nọ is the number of pairs of oppositely charged particles 
in unit volume, u, and u_ are the mobilities of the positive 
and negative ions, and e is the magnitude of the charge of the 
electron. In a wide pressure range, from 10 to 107 Pa, the mo- 
bility of gaseous ions is 
inversely proportional to the 
pressure. 

22.5.2 The relation between 

the current J and the vol- 

tage U is no longer linear 
in the 2nd region of the 
curve in Fig. 22.2 because 
the ion concentration decrea- 
ses in the gas. In this region 
the current increases slower 
and slower with an increase 

in U. 

Fig. 22.2 Beginning with a certain 
voltage Us, the current re- 
mains constant in the 3rd 

region of the curve in Fig. 22.2 when the voltage is in- 
creased. This occurs because, at a constant rate of ionization 
(22.4.4) in strong electric fields, all the ions formed in unit 
time in the gas reach the electrodes. No further increase in the 
current takes place at a fixed rate of ionization. Saturation 
results because all the charged particles produced in the gas 
travel in a strong electric field and reach the electrodes before 
an appreciable part of them have enough time to recombine with 
particles of opposite charge. 

The saturation current J, is the inaximum current that can be 

obtained at a given rate (intensity) of ionization: 


Ig => eN p» 


where NV, is the maximum number of pairs of monovalent ions 
formed in the gas in unit time at a given rate of ionization. 
The iinear relation between J; and N, confirms the ionic nature 
of electrical conduction in gases. 

22.5.3 A self-maintained gas discharge is one that continues 
after the action of the external ionizing agent ceases. To obtain 
such a discharge it is necessary for new pairs of oppositely char- 
ged particles to be formed continuously in the gas. The main 
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source of these particles is collision ionization (22.4.5). At 
a certain suflciently high voltage across the electrodes, the 
electrons are so highly accelerated by the electric field that 
their energy proves adequate to ionize the gas molecules 
(22.4.5). This is called volume ionization. Secondary electrons, 
accelerated in the electric field, also ionize the gas molecules. 
As a result, the number of current carriers in the gas and its 
electrical conductance are greatly increased (4th region in 
lig. 22.2). But, by itself, ionization by the action of electrons 
is insufficient to produce a self- 

maintained discharge. Electrons hg 

travelling in the direction from 
(he cathode toward the anode 
ionize the molecules of gas locat- 
ed closer to the anode than the 
place where the electrons are 
emitted. If the energy of the 
positive ions is insufficient for 
collision ionization of the gas 
molecules or for knocking 
electrons out of the metallic 
cathode (surface * ionization), z; 

electrons a be produced near Mge aae 

the cathode of a gas-discharge 

tube only by external ionizing agents. When the action of 
these agents is stopped, the region of collision ionization 
by electrons contracts toward the anode as the electrons 
move in this direction. Under these conditions the gas 
discharge is extinguished as soon as collision ionization ceases. 
22.5.4 Surface ionization (22.5.3), initiated at high voltage U, 
produces secondary elecirons and sets up a two-way avalanche 
of electrons and positive ions. After this the action of the exter- 
nal ionizing agent is no longer essential for continuing the gas 
discharge. When the voltage U across the electrodes of a gas- 
discharge tube is raised sufficiently, an electric breakdown of the 
gas occurs. This is the transition from a semi-self-maintained 
to a self-maintained gas discharge. The voltage U = Ubq, 
corresponding to electric breakdown, is called the ignition 
potential, or breakdown voltage. The breakdown voltage for 
a gas discharge in a tube having flat parallel electrodes at the 
distance d from each other depends upon the product pd, where 
p is the gas pressure (Fig. 22.3). In addition, Upq depends 


O —— 


pa 
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on the chemical nature of the gas and the material of the 
cathode. 

Self-maintained gas discharges are classified into glow, corona, 
spark and arc discharges. They are not discussed in the present 
handbook. 


22.6 Certain Information on Plasma 


22.6.1 Plasma is a special state of matter, whose most important 
property is a predominant, or even complete, ionization of the 
particles of matter. The degree of ionization a is the ratio of the 
number of ionized particles to their initial number. Plasma is 
classified as weakly ionized (when a is a fraction of one per cent), 
moderately ionized (when a equals several per cent) and fully 
ionized (when œ is close to 100 per cent). 

Weakly ionized plasma is found in the ionosphere, the conducting 
layer of the atmosphere. The ionosphere extends from 60 to 
2 X 10* km above the earth’s surface. 

Fully ionized plasma, formed at ultra-high temperatures (high- 
temperature plasma), exists on the sun and hot stars. Under 
laboratory conditions, plasma is produced in gas discharges 
(22.5.4) and gascous-discharge lamps. Accelerated plasma is 
employed as the working medium (11.1.1) in jet engines. Plasma 
can also be used for the direct conversion of internal energy 
into electrical energy (magnitohydrodynamic generators and 
plasma sources of electrical energy). 

The great number of charged particles in a plasma provides for 
its high electrical conductance; in this aspect its properties 
approach those of conductors of electric current. 

22.6.2 A condition for the existence of a plasma is a certain 
minimum density Pmin of charged particles. Beginning with 
this density, we can speak of a plasma rather than a simple 
accumulation of separate charged particles. The density Omin 
is determined by the inequality L > D, where L is a linear 
dimension of the system of charged particles and D is a typical 
“plasma” parameter, the length called the Debye-Hiickel screen- 
ing radius (it is also called the Debye shielding length or Debye 
length): 


PEN 
-~ ANG9N; 


b= | kT; 


i 


SATS: a: ; 
) (in cgse units), 
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qini \ -1/2 N 
D = ( > Eh; (in SI units), 


i 


where q; is the charge of the ith species of particles, n; is their 
concentration, 7; is their temperature, k is Boltzmann’s con- 
stant (8.4.5) and £ is the electric constant. Summation is to be 
carried out over all the species of particles. The quantity D 
is the distance over which the Coulomb field of any charge of 
the plasma is shielded. The shielding occurs because any charge 
is predominantly surrounded by oppositely charged particles. 
According to a more exact definition, a plasma is a quasi- 
neutral assembly of a large number of charged particles that 
occupy a region of space with the linear dimensions L >> D. 
The violation of quasi-neutrality in the plasma is eliminated 
by the strong electric fields set up in it. The Debye-Htckel 
screening radius characterizes the interaction of the particles 
in the plasma. It was found that E/E, N?/*, where N = 
= 4nnD?/3. Here Ep is the potential energy of interaction be- 
tween two particles located at an average distance of n~! from 
cach other (n being the concentration of particles), and Ek 
is the kinetic energy of these particles. The quantity N is the 
total number of charged particles inside a sphere of radius D, 
and is called the Debye number. If N is large, the plasma is 
said to be gaseous and can be dealt with thermodynamically as 
an ideal gas with the state equation p = nkT (8.4.5). 

22.6.3 Coulomb long-range interaction of charged particles in 
a plasma leads to its qualitative uniqueness that enables it 
to be regarded as a special, fourth state of aggregation of matter. 
The most important properties of plasma are: 

(a) its strong interaction, due to its high electrical conduction, 
with external magnetic and electric fields; 

(b) specific collective interaction of particles in the plasma (by 
means of a special field whose nature and origin is beyond the 
scope of this handbook); 

(c) the fact that due to long-range interaction it is a special 
kind of elastic medium in which various types of vibrations 
and waves can be readily excited and propagated. 

22.6.4 The motion of plasma in a magnetic field is employed 
to directly convert the internal energy of an ionized gas into 
electrical energy. This method is made use of in a magneto- 
hydrodynamic (MHD) generator. This generator operates as 
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follows: gas produced in the combustion of fuel and constituting 
a plasma passes through a strong traverse magnetic field. In 
a conducting plasma, as in any conductor moving in such 
a field, electromagnetic induction (24.1.1) occurs. The induced 
emf can be taken off by means of electrodes in the external 
circuit. A schematic diagram of an MHD generator is shown in 
Fig. 22.4. The ionized gas 
produced in fuel combustion 
passes through the nozzle 
and its internal energy is 


: ooo . jae . 
O Rr eZ converted into kinetic ener- 


l Dz gy. As this gas passes through 


ae 
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=9-> as 
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field B, the positive ions 
move toward the upper elec- 
trode by the action of the 
induced electric field (24.1.1) 
Fio. 22.4 and the free electrons move 
pran: toward the lower clectrode. 
When we connect the elec- 
trodes to an external load, a current is obtained in the circuit. 
22.6.5 A state of thermodynamic equilibrium is possible at a 
definite temperature in the plasma when the loss of charged 
particles due to recombination (22.4.1) is recoinpensed by new 
acts of ionization. The average kinetic energies of the various 
particles making up such a plasma are the same. The processes 
of energy exchange between the particles of this plasma, as 
well as the energy exchange of the plasma with blackbody 
radiation (36.1.8) are equilibrium processes (8.3.7). A plasma 
with the aforesaid properties is said to be isothermal. It exists 
in the atmospheres of high-temperature stars. 
A condition required for a high degree of ionization of a thermo- 
dynamically equilibrium plasma, consisting of two kinds of 
charged particles, equal in magnitude and opposite in sign, is 
the maximum reduction in the recombination (22.4.1) of the 
particles. A completely ionized plasma can be obtained at 
kT >> eVi, where V; is the ionization potential (22.4.3) of the 
gas atoms and kT is the average energy of thermal motion of 
the plasma particles. For hydrogen and deuterium this corre- 
sponds to T x 160 000 K. Essential under these conditions are 
the radiation of the plasma and the difficulty of isolating the 
plasma from the walls of its container (22.6.7). 
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22.6.6 In the plasma of a gas discharge (22.5.1) (gas-discharge 
plasma) thermodynamical equilibrium (22.6.5) is absent. The 
charged particles of such a plasma are in an accelerating electric 
field. 

The average kinetic energy of the electrons in a gas-discharge 
plasma is characterized by a certain electron temperature Te 
corresponding to the Maxwellian energy distribution of the 
electrons (10.3.7). Owing to the lack of thermodynamical equi- 
librium in such a plasma, 7, is of a conventional nature. The 
average kinetic energy of the neutral particles is substantially 
less than that of the electrons. Besides the electron tempera- 
lure Te, the parameters of a gas-discharge plasma are: the 
electron concentration ne, the number of ionizations per elec- 
tron per second, the density of the ion or electron current, and 
the longitudinal electric field strength EF, established along 
the symmetry axis of the plasma. 
22.6.7 The existence of a thermodynamically nonequilibrium 
vas-discharge plasma is made possible by the energy of the 
discharge current passing through it. In the absence of an exter- 
nal electric field, the gas-discharge plasma disappears. 

The disappearance of an unmaintained gas-discharge plasma 
that has been left to itself is called deionization of the gas. 

In addition to the ionization and recombination processes 
(22.4.1), a large share of the energy balance of a plasma that 
exists in a limited volume consists in the interaction between 
the plasma and the walls confining its volume, as well as the 
radiation of the plasma and the transport of radiation in it. 
The diffusion of the charged particles to the walls and their 
recombination on the walls, and the transfer of energy to the 
walls by heat conduction (10.8.2) contaminate the plasma with 
impurities and reduce its energy. To avoid these phenomena, 
measures are taken to confine the plasma by a magnetic field 
to prevent contact with the walls. 

Radiation of the plasma in the optical range and in the far 
ultraviolet consists of: braking radiation (bremsstrahlung) 
of the electrons that occurs when they are retarded by the ions, 
ordinary radiation produced by excited particles, and recom- 
bination radiation that occurs in the recombination process 
(22.4.1). In a magnetic field, a plasma also has a special beta- 


tron (synchrotron) radiation, which is beyond the scope of this 
handbook. 
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CHAPTER 23 MAGNETIC FIELD 
OF DIRECT CURRENT 


23.1 Magnetic Field. Ampere’s Law 


93.44 A magnetic field is one of the forms of an electromagnetic 
field (15.1.2). A magnetic field is set up by moving charged 
particles, as well as moving bodies carrying electric charges. 
Magnetic fields act only on moving electric charges and moving 
charged bodies. 

Variable electric fields (displacement currents) (27.3.1) are 
also sources of magnetic fields. 

23.1.2 The main force characteristic of a magnetic field is 
vector of magnetic induction B (also called magnetic flux density, 
magnetic displacement, or, simply, magnetic vector). Vector B 
is determined by one of three methods: 

(a) from Ampere’s law (23.1.4); 

(b) from the effect of the magnetic field on a current-carrying 
loop (23.4.2); 

(c) from the expression for the Lorentz force (24.1.3). 

/23.1.3 Magnetic fields are represented graphically by lines of 
magnetic induction. Lines of magnetic induction (lines of magnet- 
ic force) are curves whose tangent at each point coincides with 
vector B at this point of the field. The directions of the magnetic 
vector and the lines of magnetic induction are determined by the 
corkscrew (Maxwell) rule (also called the right-hand screw rule): 
if a corkscrew with right-hand thread is screwed in the direction 
of the current density vector in the conductor (20.2.3), the 
direction that the corkscrew handle turns indicates the direc- 
tion of the lines of magnetic induction and the magnetic vec- 
tor. 

Lines of magnetic induction are not interrupted at any point 
of the magnetic field, i.e. they neither begin nor end. These 
lines are either closed curves or they extend from infinity to 
infinity or they are wound infinitely on some surface, covering 
it densely all over, but never returning again to any point 
of the surface. A similar situation is observed, for example, 
in a field set up by a system of a circular current and an infinite 
rectilinear current passing through the centre of the circular 
current and perpendicular to its plane. 
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A magnetic field is said to be uniform, or homogeneous, if vec- 
lor B is constant at any point in the field. Otherwise, it is said 
lo be nonuniform (nonhomogeneous). 

23.1.4 The force exerted by a magnetic field on a current-carry- 
ing conductor placed in the field is called A mpere’s force. 
Ampere’s law states that an element of force dF exerted on a 
small element of length di of a conductor, carrying a current 
and located in a magnetic field, is proportional to the vector 
product of the element dl of conductor length by the magnetic 
induction B: i 


dF = J [dl B] (in SI units), a 


a = = [al B] (in Gaussian units, see Appendix I). 


lere dl is a vector with the magnitude dl and has the same 
direction as the density vector j of the current in the conduc- 
tor (20.2.3). 

Ampere’s force F, acting in a magnetic field on a current-carry- 
ing conductor of finite length, is 


F= \ I [dl B], 


in which integration is carried out over the whole length of the 
conductor. In the case of a uniform magnetic field (23.1.3) 


F= IBlsing& (in SI units), / 


F =< Bilsina (in Gaussian units), 


where a is the angle between the density vector of the current 
in the conductor and vector B. The relative positions of vectors 
dF, B and dl are shown in Fig. 23.1. If dl is perpendicular to B, 
the direction of force dF is found by the left-hand rule: if the 
left hand, opened flat with the fingers parallel to one another 
and the thumb perpendicular to the fingers, is held so that the 
magnetic induction vector enters the palm and the fingers 
indicate the direction of the electric current, the thumb indicates 
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the direction of the force exerted by the magnetic field on the 
current-carrying conductor. It is evident from Fig. 23.1 that 
vector dF is perpendicular to the plane passing through vec- 
tors dl and B, and that its di- 
rection is such that from the 
head of vector dF the direction 
of shortest rotation from vec- 
tor dl to vector B is counter- 
clockwise. In other words, vec- 
tor dF coincides in direction 
with the vector product [dl B]. 
23.1.5 It follows from Ampere’s 
law that the magnetic induction 
vector inSI units is numerically 
Fig. 23.1 equal to the limit of the follow- 

ing ratio: the force exerted by 

a magnetic field on an element 
of a current-carrying conductor to the product of the current 
by the length of an element of the conductor, the limit being 
taken as the length of the element approaches zero and is posi- 
pene in the field so that the limit reaches its maximum value. 
Thus 


1 (dF 

em T (T ) max ° 

In Gaussian units (Appendix I), 
c {dE 

B= + (F max’ 


where c is the electrodynamic constant (Appendix I). 

23.1.6 In contrast to electrostatic forces (14.2.2), Ampere’s 
force is not a central one (3.3.4). Ampere’s force is perpendic- 
ular to the lines of magnetic induction and to the current- 
carrying conductors. 


23.2 The Biot-Savart-Laplace Law 


23.2.4 The Biot-Savart-Laplace law determines the magnetic 
induction at any point in a magnetic field set up by a direct 
electric current flowing along a conductor of any shape. The 
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vector dB of magnetic induction at any point C of a magnetic 
field, set up by an element of a conductor carrying the current T 
and of the length di is determined by the formula 


dB= kur 5 [dl r], 


where dl is the vector of an element of length of the conductor 
(23.1.4), r is the radius vector from the element dl of the con- 
ductor to point C (Fig. 23.2), r 
is the magnitude of the radius 
vector r, k is a factor depending 
only upon the chosen system of 
units, and p, is a dimension- 
less quantity that characterizes 
the magnetic properties of the 
medium and called the relative 
magnetic permeability (or simply 
relative permeability) of the me- 
dium. The relative permeability 
does not depend upon the choice 
of units and equals unity for vacu- 
um. For all substances, except fer- 
romagnetic ones, u, differs only 
slightly from unity (26.5.1). See Fig. 23.2 

Sect. 26.4.5 for a general defini- 

tion of u, 

23.2.2 In SI units k = p,/4n, where p = 4m X 10-7 henry/m 
is the permeability of free space, or the magnetic constant (Appen- 
dix II). In Gaussian units, k = 1/c, where c = 3 x 101° cm/s 
is the electrodynamic constant (Appendix I). 

The Biot-Savart-Laplace law is of the form 


— Hour A. - 
dB = r [dalr]. 


This form of the Biot-Savart-Laplace law and of all the electro- 
magnetic feld equations is said to be rationalized. 

The product p jt, is sometimes called the absolute magnetic 
permeability (or simply permeability) of the mẹdium, 
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-In Gaussian units dB = = = [dl r]. The magnitude of vector 


dB is equal to 
por I dlsina 


dB = Pe (in SI units), 
aps- iina (in Gaussian units), 


where œ is the angle between vectors dl and r. 

23.2.3 The magnetic field strengih H is the vector characteristic 
of a magnetic field that, for a homogeneous isotropic medium, 
is related to B as follows: 


H = (in SI units), 


Hour 


H=— (in Gaussian units). 
r 


The universal relation between vectors B and H for a magnetic 
field in an arbitrary medium and the more general definition 
of E magnetic field strength vector H are discussed in Sect. 
26.4.4. 

The strength of a magnetic feld set up by a direct current in 
a homogeneous isotropic medium is independent of the magnetic 
properties of the medium. Thus 


I I dising ,. . 
dH = VE [dl r], a (in SI units), 
je Gly, CH Ga Gaai, 
er? cr? 


where œ is the angle between vectors dl and r. 

23.2.4 It follows from a comparison of the characteristics of 
electric (E and D) and magnetic (B and H) fields that the electric 
feld strength vector E is analogous to the magnetic vector B. 
Both vectors specify the force effects of the fields and depend 
-upon the properties of the medium in which the field is set 
up. 

The analogue of vector D of electric displacement (15.3.1) is 
the magnetic field strength vector H, ` | 
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23.2.5 An electric charge q, travelling in an unbounded homo- 
weneous and isotropic medium at the constant velocity v, sets 
up a magnetic field whose magnetic induction B, can be cal- 
culated by the formula” 


: Mor qu sin 
By = +t lv r], Bq = Hair suey (in SI units), ` 


L qv sin 
n= Live, Bet gq sin B 


z (in Gaussian units), 
c r 


where i. is the angle between vectors v and r, and r is the radius 
vector from the travelling charge to the point A being considered 
in the field. Vectors B, and Hg are perpendicular to the plane 
passing through vectors v and r. 

When q > 0, the shortest rotation from vector v to vector r 
is seen from the head of vector B, (and H,) to be counter- 


A By 


(ct) 


Fig. 23.3 


clockwise (Fig. 23.3a). When q < 0, then Bg (and Hg) have the 
opposite direction (Fig. 23.30). The magnetic field of a moving 
charge is variable, because the radius vector r varies in mag- 
nitude and direction as charge q travels even if v = const. The 
magnetic field of a moving charge, depending upon angle 8 
between vectors v and r, is not spherically symmetrical, as in 
the case of the electrostatic field of a point charge (14.2.3). 
The magnetic field being considered has mirror symmetry with 
respect to the direction of v. The field strength is maximum at 
points in a plane passed through the charge perpendicularly 
to vector v (under the condition that v < c). At all points on 
a straight line coinciding with vector v there is no magnetic 
field induction, 
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23.3 Simplest Cases of Magnetic Fields Set Up by 
Direct Currents 


23.3.1 Making use of the Biot-Savart-Laplace law we can deter- 
mine the characteristics (B and H) of a magnetic field set up by 
an electric current flowing in a con- 
ductor of finite dimensions and arbi- 
trary shape. According to the prin- 
ciple of superposition of fields (45.2.2) 
the magnetic induction B at an arbi- 
trary point of a magnetic field ofa 
conductor carrying the current J is 


equal to 
B= f dB 
L 


where dB is the magnetic induction 
of the field set up by an element 
of the conductor with the length dl. 
Integration is carried out over the 
whole length Z of the conductor. 
Fig. 23.4 23.3.2 A straight conductor MN car- 

rying the current I sets up a mag- 
netic field with induction B and strength H at an arbitrary 
point A equal to* 


(in SI units), 


— Bour I L —2 
B= - (cos @,—cos@,), H TT 


B= = Ur za (cos pı — Cos Pa), H= Ea (in Gaussian units), 
c To Mr 


where r, is the distance from point A to the conductor, @, and 
z are the angles formed by the radius vectors from point A to 
the beginning and end of the conductor (Fig. 23.4), p, is the 


* It is assumed in all the examples of Sect. 23.3 that the medium 
is homogeneous, isotropic and completely fills all the space in 
which the magnetic field exists, | 
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relative magnetic permeability of the medium, and pọ is the 
magnetic constant in SI units (Appendix II). 
lor an infinitely long conductor (9, = 0 and g, = 7) 


pate. fae” a Sranits, 


4n To 7 uour 
B a Ur 2L ; pe (in Gaussian units). 
K To Mr 


23.3.3 For the magnetic field at the centre of a rectangular turn 
(loop) carrying the current I 


n2-+- p2 
phr 8I Vato (in SI units), 


An ab ' — Hor 
72 p2 
p= „SV OTR Ha. (in Gaussian units), 
c ab Ur 


where a and b are the sides of the rectangle. 
23.3.4 The magnétic momeni Pm of a closed loop of arbitrary 
shape, carrying the current J, is equal to 


Pm = 1 \ n dS, (in SI units), 
S 


Pm=47 \ ndS (in Gaussian units), 
S 


where n is a unit vector of the outward normal to element dS 
of the surface S, bounded by the current-carrying loop. In the 
case of a flat loop, surface S is plane and all the normals have 
the same direction. Hence 


Pm=/JSn, pm=ZJS (in SI units), 


( 
4 1 : ; ; 
Pin ems Pm= z 1S (in Gaussian units). 
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Vector p,, is directed so that from its head the current in the 
circuit is seen to be flowing counterclockwise (Fig. 23.5). 


B 


+ Pir 


Fig. 23.5 Fig. 23.6 


23.3.5 The magnetic field set up by a circular turn, or loop, 
carrying the current /, at an arbitrary point A on the axis of 
the turn (loop) (Fig. 23.6), is specified by 


— Por __ 2Pm _ B ;, ; 
B= 4n (RIF hA’ H = ia (in SI units), 
— 2Pm — B ba Ld e 
B=pry (READE H= ia (in Gaussian units). 


Here Pm is the magnetic moment of a current-carrying turn, 
or loop, (23.3.4). 

The magnitudes of vectors B and H equal 

B= obr IR? uour S 


2 (REFh 2n (RF h2)’ 


H = 2 (in SI units), 
Holr 
pa 2ntR2 41 2IS 
Sy MR pepe o "(REF MPA 


= (in Gaussian units), 
Hr 
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where k is the distance from point A to the centre of the turn 
(loop), R is the radius and S is the area of the turn. 

23.3.6 The magnetic field at the centre of a circular turn, or 
loop, has the characteristics (23.3.5): 


Hour 2Pm raa BO oga . 
B Zn R? ’ H = Ter (in SI units), 
2Pm g OS i , í 
B= pr R3’ H= ir (in Gaussian units). 
The magnitudes of vectors B and H equal 
21 I 1 2al ; : 
me Up R’ z TNR (in Gaussian units). 


‘he direction of the magnetic field is along the axis of the loop 
and perpendicular to its plane (Fig. 23.6). 


Fig. 23.7 


23.3.7 A toroid is an annular coil wound on a core having the 
shape of a torus (Fig. 23.7). The magnetic field of a toroid is 
completely confined within its volume, 
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The characteristics of this magnetic field are calculated by the 
formulas 


NI NI 


B = uour Dae? LTA (in SI units), 
pane. ees (in Gaussian units). 
c i ec r ` 


The magnetic induction B and the strength H of the magnetic 
field on the circular centre line of the toroid are equal to 


NI : : 
Bo = polr Ra Uorn, H=nl (in SI mente 


Here N is the number of turns of the toroid, J is the current 
in the turns, r is the radius of a circle in the toroid, Rel = 
= (R, + R,)/2, R, and R, are the external and internal radii 
of the torus, and n is the number of turns per unit length of 
the toroid’s centre line. 

23.3.8 A solenoid is a cylindrical coil consisting of a large number 
of turns of wire in the form of a helix. If the turns are wound 
tight up against or suficiently close to one another, the solenoid 
is dealt with as a system of circular closed current-carrying 
loops connected in series and having the same radius and a 
common axis. 

The magnetic moment Pm (23.3.4) of the solenoid is equal to 
ie vector sum of the magnetic moments of all N of its turns. 
Thus 


Pm = NISn, 


where J is the current in the solenoid’s turns, S is the area of 
a turn, and n is a unit vector of the normal to the plane of a 
turn. Vector p,, is along the axis of the solenoid and coincides 
in direction with the magnetic field as determined by the cork- 
screw rule (23.1.3). 

The magnetic induction B and strength H of the solenoid’s 
magnetic field at an arbitrary point A, lying on its axis, are 
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numerically equal to 


He. bolt ai (COS G.— COS @,), fP2S 2 (in SI units), 
2 Holir 
fi O B. cesta 
Bo 2nurnT (COS Q — cos &,), en (in Gaussian units), 
r ‘ 


where n = N/L is the number of turns per unit length of the 
solenoid, & and a, are angles at which the ends of the solenoid 
are seen from point A (a, < a), 


A Lei 


cos Q = — — H, COS a, = mM 
t ? VREF’ 


/, is the length of the solenoid (Fig. 23.8), and R is the radius 
of the cylindrical coil. 


Fig. 23.8 


23.3.9 Under the condition chat L >> R, the magnetic field 
inside the solenoid at points on its axis sufficiently distant 
from its ends is characterized by the values 


B=pournI, H=nl (in SI units), 


B=+ 4tpynl, H= 1 4nnI (in Gaussian units). 
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The magnetic induction B and strength H of the magnetic field 
of a solenoid of sufficient length, at points of its axis coinciding 
with its eids, are equal to 


pater py 


4 n ; 
9 i H=5 m (in SI units), 


B=— 2nd , H=—20n! (in Gaussian units). 


23.4 Interaction of Conductors. Effect of a Magnetic 
Field on Current-Carrying Conductors 


23.4.1 Ampere’s force (23.1.4), which acts on an element dl 
of length of a straight conductor carrying the current Z}, and 
is exerted by a long straight conductor carrying the current J, 
located parallel to the first conductor at a distance of a, is 
numerically equal to 


ape Male et a in Si watts): 
AN a 


| i ots 


-z dl (in Gaussian units), 
where uo is the magnetic constant in SI units (23.2.2) and c 
is the electrodynamic constant (Appendix I). The force dF 
is a force of magnetic interaction (24.1.4). 

It is assumed that the lengths of the conductors are many times 
greater than the distance a between them, and that element dl 
is located far from the ends of the first conductor. It is moreover 
assumed that the conductors are in a homogeneous isotropic 
medium having the relative magnetic permeability p,. 

The force F exerted on a current-carrying conductor of length l 
is equal to 


p= botr fll y (in SI units), 


1 211I 
n 


l (in Gaussian units). 
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Conductors carrying currents J, and J, that flow in the same 
direction attract each other. If the currents flow in opposite 
yer the conductors repulse each other (see also Sect. 
24.1.4). 

Bak closed flat current-carrying loop, of arbitrary geometric 
shape, placed in a uniform magnetic field (23.1.3), is subjecf 
lo a moment of force (t orque) M equal to 


where Pm is the magnetic moment vector of a current-carrying 
Joop (23.3.4), and B is the vector of magnetic induction (23.1.2) 
of the field. Torque M is perpendicular to the plane passing 
\lhrough vectors Pm and B in such a manner that the shortest 
rotation from pm to B is seen from the head of vector M to be 
counterclockwise. The torque tends to turn the loop to a position 
of stable equilibrium, in which vectors p,, and B are parallel 
lo each other. 

The preceding formula provides for a definition of the magnetic 
induction B (23.1.2): the magnitude of the magnetic induction 
vector at a given point of a uniform magnetic field is equal to 
the maximum value of the torque Mya, acting in the vicinity 
of the point on a small flat closed current-carrying loop, having 
a magnetic moment pm of unit magnitude: 


M max M max $ > 

= —— i S 
B ir Ts (in SI units), 
p=Mmax _ Mmax (in Gaussian units). 


23.4.3 When a closed current-carrying loop is placed into a 
nonuniform magnetic field (23.4.3) in which the magnetic 
induction B of the field varies over distances comparable to the 
linear dimensions of the loop, the loop is subject to the force F 


equal to 


OB OB OB 
F= Dmx pz T Pmy Sy TPM Gp ; 


where Pmx, Pmr and Pmz are the projections of vector Pm on the 
axes of Cartesian coordinates. In particular, when vector B 
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is along the OX axis and depends only on coordinate zx [i.e. 
B, = B (z), B, = B, = 0 and B = B,i] then 


dB , 
F= Pmx Ta l 


By the action of force F an unfastened current-carrying loop 
is altracted into a region with a stronger magnetic field. 


23.5 Total Current Law. Magnetic Circuits 


23.5.1 The circulation of the magnetic induction vector B along 
a closed circuit Z is an integral of the form: 


$ Bdl= & B dl cosa, 
L L 


where L is the closed loop of arbitrary shape, œ is the angle 
between vectors B and dl, and dl is the vector of an element of 
length of the loop in the direction it is circuited. 

23.5.2 The total current law for a magnetic field in vacuum states: 
the circulation of the magnetic induction vector of a magnetic 
field along a closed loop in vacuum is proportional to the alge- 
braic sum of the currents bounded by the loop. Thus 


n 
$ Bdl=p Y In (in SI units), 
k=1 


L 
Z n 
T ; ; ; 
b Bdl=—— »> Ir (in Gaussian units), 
L k=1 


where uo is the magnetic constant in SI units (23.2.2), c is the 
electrodynamic constant (23.2.2) and n is the number of cur- 
rent-carrying conductors bounded by loop Ł of arbitrary shape. 
This law is valid for current-carrying conductors of any shape 
or size. 

In calculating the algebraic sum of the currents, a current is 
considered positive if from the head of the current density 
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vector (20.2.3) the circuiting of loop Z is seen to be counter- 
clockwise. Otherwise, the current is considered to be negative. 
For a generalization of the total current law to cover magnetic 
fields in an arbitrary media, see Sect. 26.4.2. 

23.5.3 In contrast to an electrostatic conservative field, in 
which the circulation of the strength E along any closed path 
equals zero (16.1.4), a magnetic field has a curl (it is called 
a rotational, vortical or circuital field). In such a field the cir- 
culation of the magnetic induction vector B of the field along 
a closed path is not equal to zero. If no currents are bound by 
loop L, the circulation of vector B along this loop equals zero. 
ae this does not alter the rotational nature of the magnetic 
field. 

23.5.4 The total current law can be written for either vacuum 
or for an arbitrary medium in the form of the circulation of 
field strength vector H (23.2.3) along an arbitrary closed loop Z 
that bounds the currents: 


n 
gias By I, (in SI units), 
hk=1 


9 rey eee ES, Ik (in Gaussian units). 
k=1 
If no currents are bounded by loop L, the circulation of vector H 
along this loop equals zero. 
23.5.5 The flux of the magnetic induction vector B (magnetic 


flux) through a surface element of area dS is the scalar physical 
quantity equal to 


dD, = B dS = Bp dS = B dS cos y, 

where dS = n dS, n is the unit vector of the normal to the 
surface of area ds, Bn is the projection of vector B on the direc- 
tion of the normal (Fig. 23.9), and y is the angle between vec- 


tors B and n. The magnetic flux My, through an arbitrary sur- 
face S is 


Wm = \ B dS = \ Bn dS. 
S S 
20—0186 
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In calculating this integral, the vectors n of the normals to the 
elements of surface dS should be all in the same direction with 
respect to surface S. Thus, if S is a 
closed surface, vectors n should be direct- 
ed either all outward or all inward. 
When the field is uniform and S is a 
plane surface perpendicular to vector 
B, then Bì = B = const and O, = 
= BS, 
Fig. 23.9 23.5.6 The Osirogradsky-Gauss theorem 
for a magnetic field states that the 
mene flux through an arbitrary closed surface equals zero. 
Thus 


È Bdas= 4 Bn ds =0. 
S S 


This theorem is a mathematical expression of the fact that in 
nature there are no magnetic charges at which lines of magnetic 
induction begin or end (23.1.3). 

The differential form of the Ostrogradsky-Gauss theorem for 
a magnetic field is one of Maxwell’s equations for an electro- 
magnetic field (27.4.2). 

23.5.7 A magnetic circuit is a set of bodies or regions of space 
to which a magnetic field is confined. For example, the internal 
region of a toroid (23.3.7) and that of an infinitely long solenoid 
(23.3.9) are magnetic circuits. To strengthen their magnetic 
fields, magnetic circuits are made of materials having a high 
relative magnetic permeability u, (23.2.1), such as iron. The 
design of magnetic circuits, which are the essential components 
of electrical machinery and such electric devices as transformers 
and electromagnets, is based on the laws of magnetic circuits. 
23.5.8 Ohm's law for a closed magnetic circuit (the Hopkinson 
formula) is 


Im 

Rm ’ 

where Pm is the magnetic flux, which is constant along each 
portion of the magnetic circuit, Fm = IN is the magnetomotive 


force (mmf), or magnetizing force (in SI units), N is the number 
of turns carrying the electric current J for magnetizing the 


On = 
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circuit, and Rm is the total€¥magnetic resistance (reluctance) of 
the circuit. The magnetic resistance of a magnetic circuit of 
length l; is 

l; 


l 
dl ; : 
m= | PTE (in SI units), 


where p, is the relative magnetic permeability of the given 
circuit, Up is the magnetic constant in SI units (23.2.2), and 
S is the cross-sectional area of the circuit. When S = const, 
lmi = l/pop,S (in SI units). : 
23.5.9 The total magnetic resistance Rm of the parts of a mag- 
netic circuit connected in series is 


n 
Rm = >; Rmi, 


i=1 


where Rm; is the magnetic resistance of the ith part of the mag- 
netic circuit, and n is the number of parts that make up the 
circuit. 

If n magnetic resistances are connected in parallel, the total 
magnetic resistance Rm of the circuit is 


= 4 

— n 
5 4 
i Rmi 
i=1 


23.5.10 A branch point of a magnetic circuit is a region of space 
or bodies where there are more than two possible directions of 
the lines of magnetic induction (23.1.3). 

Kirchhoff’s first law for branched magnetic circuits states that 
the algebraic sum of the magnetic fluxes in the branches that 
join in a branch point is equal to zero. Thus 


n 
> Dmi = 0, 
i=1 


where n is the number of branches joined at the branch point 
(cf. Sect. 23.3.1). 


20% 


Rm 
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A magnetic flux is considered positive if the lines of induction 
enter the branch point, and negative if the lines emerge from it. 
23.5.11 Kirchhoff’s second law for magnetic circuits states that 
in any closed loop (mesh of a network), arbitrarily chosen in 
a branched magnetic circuit (network), the algebraic sum of the 
products of the magnetic fluxes by the magnetic resistances of 
the corresponding portions of the circuit (23.5.8) equals the 
algebraic sum of the magnetomotive forces in this loop (23.5.8); 


k k 
>) OmiRmi= >) Fmi, 
i=1 


i=1 


where k is the number of portions that make up the closed loop 
(cf. 21.3.2). The fluxes Dy; and mmf’s Fm; are considered po- 
sitive if the directions of the corresponding lines of magnetic 
induction (23.1.3) coincide with the arbitrarily chosen direc- 
tion for circuiting the loop. 


23.6 Work Done in Moving a Current-Carrying 
Conductor in a Magnetic Field 


23.6.1 An unfixed current-carrying conductor is moved by 
Ampere’s force (23.1.4) when it is placed in a magnetic field. 
The element of work 6W* done by the Ampere force in moving 
an element, of length di, of a conductor carrying the current 
I is 


dw* =I d* (in SI units), 


éw* =I d®* (in Gaussian units), 


where d@* is the element of magnetic flux (23.5.5) through the 
element of surface dS generated by the element of the conductor, 
of length dl, in its small displacement, and c is the electrody- 
namic constant (23.2.2). 

23.6.2 In a small displacement of a current-carrying conductor 
of finite length, the work 5W done by the Ampere forces is the 
sum of the elements of work for all the elements of length of the 
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conductor. It is equal to the integral of 5W* calculated along 
the whole length 7 of the conductor. Taus 


sw = | raog = | dO* =I dM (in SI units), 
l 


wat I dÙm (in Gaussian units), 


where d®m is the magnetic flux through the surface generated 
by the conductor of length J in a small displacement. 

23.6.3 If a conductor of finite length / carries a constan tcurrent J 
and is moved a finite distance, the required work of the Ampere 
forces for this motion is 


W = Imn, 


where Mm is the magnetic flux through the surface generated 
by the conductor in its motion. 

23.6.4 The work done by Ampere forces in the displacement 
of a closed flat loop carrying the constant current J in a magnetic 
field from initial position 1 to final position 2 is equal to 


Wi, = I (Dm2 — Pm), 


where Dy; and Mm, are magnetic fluxes “linked to the loop” 
in positions 1 and 2, i.e. the magnetic fluxes through a surface 
bounded by the loop. In calculating these magnetic fluxes, the 
direction of normal n (23.5.5) is coordinated with the direction 
of the current in the loop by the corkscrew rule: from the head 
of the vector of the normal, the current is seen flowing counter- 
clockwise in the loop. l 
23.6.5 The formulas given above are valid for a current-carrying 
coil of N turns moving in a magnetic field by Ampere forces, or, 
in general, for a loop of arbitrary shape. For example, the work 
done in a small displacement of a coil carrying the current I 
is equal to 


N N 
6bW=I >) dOmi=Id( >) Omi) =1 dY, 
i=1 i=1 
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N 
where ¥ = Be @m; is the total magnetic flux through all N 


i=1 
turns of the coil; it is called the flux linkage of the circuit. 
23.6.6 The work required to move a conductor or closed loop 
carrying a constant current in a magnetic field is done at the 
expense of the energy consumed in the current source (21.1.3). 


CHAPTER 24 MOTION OF CHARGED PARTICLES 
IN ELECTRIC AND MAGNETIC FIELDS 


24.1 Lorentz Force 


24.1.1 Not only current-carrying conductors (23.1.4) are sub- 
ject to the action of a magnetic field. This is also true of separate 
Charge” particles moving in the field. The force FL, acting on 
an electric charge q moving in a magnetic field at the velocity 
v, is called the Loreniz force and is equal to (see also Sect. 24.1.5) 


F; =q {vB] (in SI units), 
Fy =— [vB] (in Gaussian units), 


where q is the algebraic value of the moving charge, B is the 
induction of the magnetic} field in’ which the charge is moving 
(23.1.2), and c is the electrodynamic constant (23.2.2). The 
relative arrangement of vectors v, B andiFr, is shown in 
Fig. 24.4 for positive (q > 0) and negative (q < 0) charges. 
The magnitude of the Lorentz force is 


FL = quB sin Q, 


where @ is the angle between vectors v and B. 

24.1.2 The Lorentz force is always perpendicular to the velocity 
of the charged particle and imparts normal acceleration (1.4.6) 
to it. Since the Lorentz force does not change the magnitude of 
the velocity, but only its direction, this force does no work and 
the kinetic energy of the charged particle remains constant 
when it moves in a magnetic field. 
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24.1.3 Making use of the Lorentz force, magnetic induction B 
(23.1.2) can be defined as follows: the magnitude of the magnet- 
ic induction vector at a given point of the magnetic field is 
equal to the maximum Lorentz force FL max acting on a unit 


Ww, 
VU 


B 
g>0 F, 
Fig. 24.1 


P n charge, which, at the given point, travels at unit ve- 
ocity. Thus 


EF 
B= L max ; 
qv 


Iiere FL = Fy max under the condition that a = m/2 (24.1.2). 


See also Sects. 23.1.2 and 23.4.2. 

24.1.4 The force exerted on moving charge g, by moving charge 
qı is called the-force of their magnetic interaction (magnetic force). 
For the special case of two positive charges q, and gə, travelling 
in vacuum at the same velocity v, = va = v & c along the OX 
axis, the force Fint of magnetic interaction is a force of attrac- 
tion and is numerically equal to 


a 


Fee Fo Li v? (in SI units), 


where r is the distance between the charges, and uo is the magnet- 
ic constant (23.2.2). 
The force Fint of magnetic interaction can be presented in the 
form 

bo 


Fint =W A qB, 
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where 
— Boli? 
a Amr? ° 


In this form the force Fint coincides with the Lorentz force at 
sin @=1 (24.1.1) if we assume that B is the induction of the mag- 
netic field (23.1.2) set up by moving charge qı. In its turn, this 
field acts on moving charge q». Magnetic forces and a magnetic 
field are the result of the way in which forces are transformed 
in going over from a fixed to a moving frame of reference, and 
follows from the formulas of the special theory of relativity 
(5.1.1). 

Comparing force Fint with the force F, of electrostatic repul- 
sion between mutually fixed charges q; and q; in vacuum (e, = 1) 
located at the same distance r from each other (14.2.6) 


mae 192 7: i 
| = ine, r (in SI units), 


woes € is the electric constant in SI units), we readily find 
that 


Fint _ v? 
Fe o c2 j 


where c is the electrodynamic constant (23.2.2) and is related 
to € and uo by the equation E€ọuoọ = 1/c? (Appendix I). When 
the velocity of the charges is small compared to the velocity of 
light in free space (v<«& c), the magnetic interaction between 
the moving charges is considerably less than their electrostatic 
interaction. But in the case when the charges are moving in an 
electrically neutral conductor, the electric forces are compen- 
sated (16.4.2), leaving only magnetic interaction. This explains 
the magnetic interaction of current-carrying conductors (23.4.1). 
Though the force of magnetic interaction between each pair 
of electrons is small, the number of pairs is so immense that the 
resultant force of magnetic interaction of parallel current- 
carrying conductors is an appreciable amount (23.4.1). 

24.1.5 If, in addition to a magnetic field of induction B, a 
moving electric charge is also subject to the action of an electric 
field of strength E (15.1.2), the resultant force F applied to the 
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charge is equal to the vector sum of the force F, = qE, exerted 
on the charge by the electric field and the Lorentz force (24.1.1), 


F = qE -+q [vB] (in SI units), 


F =qE += [vB] (in Gaussian units). 


The iast expression is also called the Lorentz force and, 
sometimes, the generalized Lorentz force or the Lorentz formula. 
24.1.6 By the action of the Lorentz force (24.1.1), a charged 
particle travels in a circular path of constant radius r in a uni- 
form magnetic field perpendicular to the particle’s velocity. 
This circular path is in a plane perpendicular to vector B, 
and the Lorentz force is a centripetal one (2.4.3). The radius 
of the circular path is 


m V 
r= ——-— (in SI units), 
lal BS 


cmv 


ig] B 


r == 


(in Gaussian units), 


where | q | is the absolute value of the particle’s charge, m is 
the mass of the particle, v is its velocity, B is the magnetic field 
induction and c is the electrodynamic constant (23.2.2). If 
the particle travels in the plane of the drawing (Fig. 24.2), its 
deflection in a field perpendicular to the velocity and directed 
from behind the drawing depends upon the sign of the charge. 
This is the basis for determining the sign of the charge of a 
particle travelling in a magnetic field. 

The period 7 of revolution (1.5.5) of a charged particle in a 
uniform magnetic field (23.1.3) is independent of its velocity 
(when v< e): 


2 m ; . 

T = -5 Tal (in SI units), 

T = An ie (in Gaussian units) 
B |q] ` 
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The principle of cyclic orbit accelerators of charged particles 
(24.4.6) is based on the aforesaid. 

24.1.7. If the velocity vector v of a charged particle makes 
the angle œ with the direction of vector B of a uniform magnetic 


Fig. 24.2 Fig. 24.3 


field, the particle travels along a helix (Fig. 24.3) of radius r 
and pitch h equal to 


m vsing 2% m 
r= —- —— , h=>— — —— vcosa (in SI units), 
ial B B Tal i l 
pe v sin g aoe vcos @ (in Gaussian units) 
ll B ’ B ql l 


If this motion occurs in a nonuniform magnetic field (23.1.3) 
whose magnetic induction increases in the direction of motion 
of the particle, then r and h decrease with an increase in B. 
This phenomenon is used for focusing charged particles in a 
magnetic field. 


24.2 Hall Effect 


24.2.1 The Hall efect refers to the setting up of a transverse 
electric field and potential difference in a current-carrying metal 
or semiconductor placed in a magnetic field which is perpen- 
dicular to the direction of the current density vector (20.2.3). 
The Hall effect results from the deflection of electrons moving 
in a magnetic field by the action of the Lorentz force (24.1.1). 
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Shown in Fig. 24.4a are the directions of the magnetic induction 
B, current density j, electron velocity v, Lorentz force FL and 
the signs of the charges accumulated on the opposite, upper and 
lower, sides in the case of a metal or electron (n-type) semicon- 
ductor (41.10.2). In a hole (p-type) semiconductor (41.10.3) 
the signs of the charges on the surfaces are the opposite of those 


Fig. 24.4 


in the preceding case (Fig. 24.40). The charges are deflected by 
the magnetic field until the action of the force in the transverse 
electric field tounterbalances the Lorentz force. __ 

24.2.2 The equilibrium potential difference in the Hall effect is 


IB 
AV = V, — V, = Ry a 3 


where J is the current, B is the magnetic field induction (23.1.2), 
d is the linear dimension of the metal or semiconductor in the 
direction of vector B, and Ru is the Hall coefficient. 

The strength Eu of the transverse clectric field in the Hall 


effect is 
Eu = Ruy [Bi], 


where j is the current density vector. 
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24.2.3 For metals and extrinsic semiconductors (41.10.5), 
having a single kind of conduction, the Hall coefficient equals 


A 
Ry= — (in SI units), 
H= Ta ( ) 


Ry= in Gaussian units 
H nod ( ); 


where c is the electrodynamic constant (23.2.2), q and ny are 
the charge and concentration of the current carriers, and A ~ 1 
is a dimensionless factor depending upon how the current- 
carrier velocities are statistically distributed. The sign of the 
Hall coefficient coincides with that of the current-carrier charge 
q. A measurement of the Hall coefficient of a semiconductor 
indicates its type of electrical conduction. For an n-type semi- 
conductor (electron conduction) (41.10.2), q = — eand Ru < 0; 
for a p-type semiconductor (hole conduction) (41.10.3), q = e 
and Ru >Q. 

If both types of electrical conduction are observed in a semi- 
conductor, the predominant conduction can be determined from 
the sign of the Hall coefficient. The formula for Ru given in 
this subsection is inapplicable in such cases. 

24.2.4 Provided the type of electrical conduction is known, 
a measurement of the Hall coefficient enables the concentra- 
tion no of the current carriers to be determined. For example, 
the concentration of conduction electrons equals the concen- 
tration of atoms in monovalent metals. This means that there 
is one free electron per atom in the electron gas (20.3.1) of the 
metal. A known concentration of current carriers enables the 
mean free path (A) of the electron in the metal to be assessed, 
From the formula of (20.3.4), 


_ 2xm (u) 
a noe? 


9 


we find that (1) ~ 10-8 m, which exceeds the lattice constants 
of the metal by two orders of magnitude (see also Sects. 20.3.7 
and 44.5.5). 


~I 
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24.3 Charge-to-Mass Ratio of Particles. Mass 
Spectroscopy 


24.3.1. The charge-to-mass ratio of a particle, q/m, is a character- 
istic of charged particles. Experimental determination of this 
charge-to-mass ratio is based on measuring the deflection of the 
particles in electric and magnetic fields acting jointly on them. 
The mass of the particles is determined from the charge-to-mass 
ratio q/m and the known charge g. 

24.3.2 The mass spectrum of particles is the set of their mass 
values. Appiying mass spectrometers and mass spectrographs, 
special instruments ed in 

mass spectrometry. the rela- 
tive concentrations of iso- 
topes of chemical elements 
(42.1.3) and their masses can 
be measured with exceptio- 
nally high accuracy. 
24.3.3 In the Aston mass 
spectrograph (Fig. 24.5). 
which was employed to dis- Fije. 94.5 

cover the isotopes, of various ° 

chemical elements, the par- 

ticles are deflected in the uniform electric field of capacitor 
C and the magnetic field of coil M, the fields being perpen- 
dicular to each other. The lower the velocity of the particles 
and the greater their charge-to-mass ratio, the more strongly 
they are deflected toward the capacitor plates. In a uniform 
magnetic field with the induction B directed into the drawing, 
the particles travel along circular arcs (24.1.6). The higher the 
velocity of the particles and the lower their charge-to-mass ratio, 
the greater the radii of these arcs. In a magnetic field a beam 
of particles is split into several beams, each corresponding to 
a definite charge-to-mass ratio. The magnetic field focuses par- 
ticles having different velocities, but the same charge-to-mass 
ratio, on a single point. The magnetic induction B is chosen so 
that the charged particles are focused on a photographic plate 
AD, which is perpendicular to the plane of the drawing. A 
series of narrow parallel lines obtained on the plate corresponds 
to the various values of the particle charge-to-mass ratio. In 
Fig. 24.5, line €, corresponds to particles with a higher charge- 
to-mass ratio, and line €; to particles with a lower one. If 
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the charge-to-mass ratio of the particles of line €, is known, as 
well as the distance between lines €; and €, and the parameters 
of the instrument, the charge-to-mass ratio of the particles 
corresponding to line G, can be determined. 

If the parallel-plate capacitors are replaced by tubular ones 
and special electromagnets are used that produce a plane-paral- 
lel beam of particles entering the magnetic field, we can achieve 
double focusing of the particles with respect to both energy and 
direction. With sufficient intensity of the lines on the photo- 
graphic plate, this technique ensures high precision in particle 
mass Measurements. For instance, the masses of the ions of 
light-weight elements can be measured to an accuracy within 
10-4 per cent. 

24.3.4 The relative, concentrations of the isotopes of chemical 
elements in their natural mixtures are measured by instruments 
called mass spectrometers that electrically record the ion currents. 
Used in mass spectrometers are ion beams in which the kinetic 
energies of the ions are close in value. These are called mono- 
chromatic (monoenergetic) beamsjand are produced by ion 
sources of special design. Even when they are strongly divergent 
and contain a great number of ions, such beams focus well in 
a transverse magnetic field. This raises the precision with which 
the concentrations of various isotopes can be measured. 


24.4 Charged Particle Accelerators 


24.4.1 Special installations for obtaining directed beams of 
charged particles (electrons, protons, atomic nuclei and ions 
of chemical elements) having an extremely high kinetic energy, 
under laboratory conditions are called accelerators. According 
to the shape of the path of the particles and the mechanism for 
accelerating them, distinction is made between linear, cyclic 
and induction accelerators. In linear accelerators (also called lin- 
acs) the paths of motion of the particles are close to straight 
lines; in the cyclic and induction accelerators the paths of the 
particles are circles or unwinding spirals. 

24.4.2 The energy of the accelerated particles is increased 
by the electric field set up in the accelerator. Depending on the 
type of accelerator, this field can be electrostatic (15.1.2), 
induced (25.1.2) or high-frequency alternating. The principle 
of the betatron, based on the action of an induced electric field, 
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is discussed in Sect. 27.2.3. In a linear electrostatic accelerator, 
a charged particle passes only once through an accelerating 
electric field with a potential difference (Vı — Və). If q is the 
charge of the particle, the energy acquired by the particle in 
the accelerator (15.2.7) is equal to 


E = q (Vi — Vo). 


In such an accelerator the electric field is set up, for example, 
by a Van de Graaff high-voltage electrostatic generator, operat- 
ing by the principle of multiple transfer of charges to a hollow 
conductor (16.4.3). This raises 
the potential of the con- 
ductor to values limited only 
by the leakage of the charge 
from the conductor. 

24.4.3 In linear resonant ac- 
celerators, the energy of the 
charged particles is increased 
by an alternating electric 
field of super-high frequency, 
which variesin synchronism 
with the motion of the par- 
ticles. Using this type of 
accelerator, electrons can be 
accelerated to an energy of 
the order of dozens of GeV 
over a path length of sev- lig. 24.6 

eral kilometres. 

24.4.4 Protons, deuterons and other, heavier, charged par- 
ticles are accelerated by resonant cyclic accelerators, in which a 
particle repeatedly passes through an electric field synchronized 
with its motion, increasing its energy each time. A strong trans- 
verse magnetic field controls the motion of the particles and 
returns them periodically to the region of the accelerating elec- 
tric field. The particles pass definite points of the alternating 
electric field approximately when the field is in the same phase 
(“in resonance”). 

24.4.5 In the cyclotron, the simplest resonant cyclic accelerator, 
an accelerating alternating electric field is set up in the gap 
between the two halves (dees) N and M (Fig. 24.6) of a cylindri- 
cal box. The dees are placed in a flat closed chamber located be- 
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tween the poles of a strong electromagnet, whose magnetic 
field is perpendicular to the plane of the drawing. The alternat- 
ing electric field in the gap between the dees is set up by an 
senso generator whose terminals are connected to electrodes 
m and n. 

24.4.6 A particle is accelerated in the gap between the dees M 
and N each time it crosses this gap after describing semicircles 
of continuously increasing radii due to the action of the magnet- 
ic field. The same time is required for each semicircle (24.1.6). 
To continuously accelerate the particles it is necessary to com- 
ply with the condition of synchronism (condition of “resonance”): 
T = To, where T is the period of revolution of the particle in 
the magnetic field (24.1.6), and Tọ is the period of oscillation 
of the magnetic field (28.1.2). This condition is violated at rel- 
ativistic velocities v of the particle, commensurate with the 
velocity of light c in free space. At such high velocities, the 
mass m of the particle increases with the velocity (5.6.1) as 
does the period T (24.1.6). 

24.4.7 The feasibility of accelerating charged particles. trav- 
elling at relativistic velocities in cyclic accelerators follows 
from the principle of phase stability (autophasing), which states 
that each deviation of the period 7 from the resonance value 
To (24.4.6) leads to such a change in the energy E of the particle 
at each acceleration that T varies in the neighborhood of To 
and, on an average, remains equal to Tọ. Thus 


2m m 2n E 


Pa (eee ea 


B i] B qe 


where E = me (5.7.2), m = mo/ V1 — v/e2 (5.6.1), mp is the 
rest mass (5.6.1) of the particle, and c is the velocity of light 
in free space; the remaining notation is the same as in Sect. 
24.1.6. i 
If, for example, as a result of the increase in mass m and the 
eriod T, the particle enters the gap between the dees (Fig. 24.6) 
by the action of the decelerating instead of accelerating electric 
field, the reduction in particle velocity leads to the reduction 
in 7 and the equality T = Tọ is reached again. 
24.4.8 It follows from the principle of phase stability (24.4.7) 
that at a sufficiently slow increase in the period 7’, of the elec- 
tric field, the period 7 of revolution of the particle in the mag- 
netic field of the accelerator increases correspondingly. This 
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leads to an increase in the average energy (E) of all the particles 
because at constant induction of the magnetic field any increase 
in T is possible only owing to the increase in mass as the veloc- 
ity of the particles increases. 
This principle has been applied in an accelerator called the 
synchrocyclotron (also frequency-modulated cyclotron and phaso- 
Iiron). This device has a constant magnetic field, whereas the 
frequency vy = 1/To (28.1.2) of the alternating electric field 
varies slowly with the period t To. The radius of the orbit 
of the particles in the synchrocyclotron increases with their 
velocity (24.1.6). This requires a corresponding increase in the 
size of the synchrocyclotron to reach the maximum energy of 
the charged particles. The synchrocyclotron built and operated 
in the USSR, accelerating protons to an energy of 680 MeV, 
has a magnet with poles 6 m in diameter and a mass of 7 X 108 
inetric tons. 
24.4.9 The synchrotron is an accelerator in which the frequency 
of the accelerating electric field is constant, and the induction 
B of the magnetic field varies with time. The period of revolu- 
tion of a particle in the magnetic field of a synchrotron (24.4.6) 
is 

_ 2m EK 

jefe? B’ 


where e is the charge of the electron, and £ is its energy. The 
condition of synchronism (24.4.6) is complied with in a syn- 
chrotron at Tọ = const, provided that the induction of the 
eens field increases proportionally to the particle’s energy. 
‘Thus 

| an E 

D= Tele Ty’ 

uae To is the period of a high-frequency accelerating electric 
field. DA 


The following condition is complied with in a synchrotron: 
m ely a 


B On = Const. 


The particles travel along orbits close to circular ones: (24.4.6) 
and, consequently, a synchrotron is equipped with annular 
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magnets that set up a magnetic field in a comparatively narrow 
region of the circular orbit. 

24.4.10 A proton synchrotron, the most powerful accelerator of 
protons, combines the principles applied in the synchrocyclo- 
tron (24.4.8) and in the synchrotron (24.4.9). In this accelera- 
tor, the frequency v, of the accelerating electric field is reduced 
and the induction B of the magnetic field is increased simulta- 
neously and in coordination. The protons being accelerated 
travel along a circular orbit of constant radius. Therefore, the 
magnetic field is set up by an annular electromagnet as in the 
synchrotron. 

24.4.11 The condition for simultaneous maintenance of vertical 
(axial) and radial stability of the design circular orbit in the 
synchrotron and proton synchrotron is the variation of magnet- 
ic induction B near the design orbit according to the law 


const 
rn’ 


B= 


where r is the distance from the centre of the orbit, and n varies 
within the limits 0 < n < 1 (condition of weak focusing in the 
accelerator). With an increase in the maximum energy Emax 
acquired by the particles in an accelerator having weak focus- 
ing, the required mass of the electromagnet increases ap- 


proximately in proportion to Emax. 

24.4.12 Accelerators having strong focusing are resorted to for 
increasing the maximum energy Fmax of particles accelerated 
in synchrotrons and proton synchrotrons. In these accelerators, 
two kinds of sector-shaped magnets are used alternately along 
the almost circular orbit of particle travel. In one kind of sector 
the magnetic field varies according to the law of Sect. 24.4.11, 
in which n is much less than zero (for example, n = — 100), 
and in the other kind n> 1. Sectors of the first type provide 
for radial focusing of the beam of particles being accelerated; 
sectors of the second kind, for vertical focusing. The applica- 
tion of strong focusing enables substantial reductions to be ob- 
tained in the overall size of the accelerator, the mass of the 
electromagnet and the total cost of the installation. This method 
is also called alternating-gradient focusing. 

24.4.13 To increase the share of energy utilized by accelerated 
particles for various nuclear reactions (42.9.1), the bombard- 
ment of stationary targets by high-energy particles is replaced 
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by the colliding beam technique. It follows from the laws of 
conservation of energy (3.4.3) and momentum (2.7.1) that in 
bombarding a stationary target, the share of the kinetic energy 
ly of the incident particle, utilized in the nuclear reaction, 
decreases as E, is increased. When the technique of colliding 
beams is used, the total momentum of the particles after col- 
lision is reduced and the share of their useful energy is increased. 
Assume, for example, that in an accelerator with colliding beams 
of protons the energy of the protons in each beam is equal to 
26 GeV. The total momentum of two protons colliding at equal 
and opposing velocities equals zero. But the collision energy 
of such two particles reaches 50 GeV. To obtain the same colli- 
sion energy in bombarding a stationary hydrogen target, a beam 
of protons with energy of the order of 1400 GeV is required. 


CHAPTER 25 ELECTROMAGNETIC INDUCTION * 


25.1 Basic Law of, Electromagnetic Induction 


25.1.1 The phenomenon of electromagnetic induction is the sett- 
ing up of an induced electric field in a conducting loop, or circuit, 
located in a varying magnetic field. The energy measure of this 
induced field is the electromotive force 64 of electromagnetic in- 
duction. If the loop is closed, the induced electric field causes 
ordered motion of electrons in the loop, i.e. a current, called the 
induced current, is produced. | 
25.1.2 Faraday's law of electromagnetic induction states that the 
emf é; of electromagnetic induction in a loop, or circuit, is 
proportional to the rate of change of the magnetic flux Dp» 
(23.5.5) through the surface area bounded by this loop. Thus 


co dm 

E= — Ji (in SI units), 

vo. í dm ; i 

61 = er (in Gaussian units). 


* In all the formulas written in Gaussian units, c is the electro- 
dynamic constant (23.2.2). 
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The cause of the change in the magnetic flux is unessential. 
It may be due to distortion of the loop or its motion in the ex- 
ternal magnetic field or to any other reasons for the change 
in the magnetic field with time. When a closed conducting 
circuit (or loop) moves in a magnetic field, emf’s 4, are induced 
in all of the parts of the loop that cut the lines of magnetic in- 
duction. The algebraic sum of these emf’s (25.1.3) is equal to 
the total emf of the loop. The work that must be done to move 
a closed loop, or circuit, in a magnetic field is equal to the work 
of the current induced in the loop. 

In calculating ©,, and 6; the directions around the loop (21.3.2) 
and of the outward normal n (23.5.5 and 23.5.8) are to be coordi- 
nated so that from the head of vector n the direction around 
the circuit (or loop) is seen to be counterclockwise. If a closed 
circuit consists of N turns connected in series (for example, 
the solenoid in Sect. 23.3.7), then the magnetic flux Ọm in 
Faraday’s law is to be replaced by the flux, or magnetic, link- 
age W of the circuit (23.6.5): 


av : 
é=— J; (in SI units), 


1 d¥ : l 
C ae (in Gaussian units). 


25.1.3 The emf of electromagnetic induction in a loop (or cir- 
cuit) is said to be positive if the magnetic moment vector Pm 
of the corresponding induction current (23.3.4) makes an acute 
angle with the lines of magnetic induction of the field that in- 
duces this current. Otherwise, @; is said to be negative. In 
Fig. 25.4a, the emf 6; < 0, whereas in Fig. 25.1b, €; > 0. 
25.1.4 The minus sign in the formula for é; complies with the 
Lenz law, which states that upon any change in the magnetic 
flux through a surface bounded by a closed loop, the current 
«induced in the loop is always of a direction in which the magnet- 
ic field set up by the induced current opposes the change in the 
magnetic flux that induced the current. 

25.1.5 An emf of electromagnetic induction is developed in 
a part of a conductor which, in its motion, cuts lines of induc- 
tion of a magnetic field (23.1.3). In the case illustrated'in Fig. 25.2, 
the conduction electrons of the metal (20.3.1) are subject to the 
Lorentz force (24.1.1). Thus FL = — e [(v -+ v’) B], where v 
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is the velocity of the part AC of the conductor in the magnetic 
field, whose induction vector B is perpendicular to the plane 
passing through the part of the conductor and the velocity 


e tang (9) Hines 


6) <0 
Fig. 25.1 


vector of its motion. The electrons travel orderly along the 
conductor at the velocity v’ by the action of the component of 
the Lorentz force tangent to the conductor in the direction from 


Fig. 25.2 


A to C. The motion of the electrons ceases when the electro- 
static field set up in the conductor and acting on the electrons 
with the force eE counterbalances the Lorentz force, 
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According to Ohm’s law for an open circuit (at J = 0) (21.2.5), 
the equilibrium potential difference AV = V4, — Vc between 
points A and C that is established at v’ = 0 is equal to 


AVE Vy SV = = Si 


where €; is the electromotive force of induction, because there 
are no sources of electrical energy in part AC of the conductor. 
The emf of electromagnetic induction in a piece of conductor 
of length 1, moving at the velocity v, is 


6i = — Bul, 
or 
(Dp 
= — dm (in SI units), 


dt 


where d@,,/di is the ratio of the magnetic flux through the 
surface generated by the conductor in its motion during an in- 
finitesimal length of time to the value dt of this length of time 
or, in other words, the velocity at which the conductor cuts 
the lines of induction of the magnetic field (cf. Sect. 25.1.2). 
25.1.6 The phenomenon of electromagnetic induction in sta- 
tionary closed conducting circuits, located in an external varying 
magnetic field, cannot be explained by means of the Lorentz 
force, which has no effect on fixed charges (24.1.1). 

The phenomenon of electromagnetic induction in stationary 
conductors is due to the fact that the varying magnetic field 
excites a rotational electric field. The circulation of the strength 
vector of this field along the closed path Z of the conductor 
(16.1.4) is the emf of electromagnetic induction. Thus 


D , 
E= ) E dl= — st (in SI units), 
L 
D 
6i= D Edl= T Om (in Gaussian units), 


where the partial derivative 0@,,/dt takes into account the 
dependence of the magnetic induction flux only on time. The 
choice of the direction of normal n in calculating the magnetic 
flux is discussed in Sect. 25.1.2, | . 


25.2] PHENOMENON OF SELF-INDUCTION 397 


25.1.7 The magnitude q of the electric charge passing through 
the cross section of the wire of the turn in which the current is 
induced is 


(Pp! — D” 
q = -= (in SI units), 
R 
D’ ace Ud 
q= = 2 T ™. (in Gaussian units), 


where Dm and Mm are the values of the magnetic flux through 
the area of the turn in its initial and final positions, and R is 
the electrical resistance of the turn. 


25.2 Phenomenon of Self-Induction 


25.2.1 Self-induciion is the production of an induced electric 
field in a circuit as a result of the variations of current in the 
same circuit. The energy characteristic of the induced field is 
the emf of self-induction €s. 

25.2.2 The intrinsic magnetic field of the current in the circuit 
produces the magnetic flux Dms through the surface S bounded 
by the current-carrying loop. Thus 


S 
where Bn is the projection of the induction vector B of the mag- 
netic field (23.1.2) on the normal n to the element of surface dS. 
The quantity Pms is called the magnetic flux of self-induction of 
the loop. Making use of the Biot-Savart-Laplace law (23.2.2), 


ms can be calculated for the case in which the loop is in a 
nonferromagnetic medium (26.5.2): 


_ r Lo h Br 
Pns = Fa \ ds $ -3 [dl r]n (in SI units), 
S L 


Pms => I | dS $ a [dlr], (in Gaussian units), 
S L 
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or 
Oms= LI (in SI units), 


Oms = Si (in Gaussian units), 


where L is called the self-inductance of the loop and is deter- 
mined by the formulas 


=. Pr dirja ( 
= 4n \ as È 73 [dl r]n (in SI units), 
S L 


L= \ dS $ Hr [dlr], (in Gaussian units). 
S L 


Here uo is the magnetic constant in SI units (23.2.2), p, is the 
relative magnetic permeability of the medium, r is the radius 
vector from element di of the loop to element dS of surface S, 
bounded by the loop, and subscript n indicates the projection 
on the normal to the element dS. 

25.2.3 The self-inductance L of the loop is numerically equal 
to the self-induction flux at a current of J = 1 A (in SI units) 
or I = c (in Gaussian units). Self-inductance Z depends only 
upon the geometric shape and dimensions of the loop, as well 
as on the magnetic properties of the medium it is located in. 
The self-inductance of a sufficiently long solenoid (23.4.8) is 


2 
L= Bohr L popen? Y (in SI units), 
„N? 
L = ENS App n2V (in Gaussian units). 


where Z and S are the length and cross-sectional area of the so- 
lenoid, N is the total number of turns, n = N/L is the number 
of turns per unit length, and V = SI is the volume of the so- 
lenoid. 
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25.2.4 The expression for the emf of self-induction follows from 
Faraday’s law (25.1.2): 


ms d | | 
Es = — 7 =e, AE) (in SI units), 

C Sa 1 dDms ray 4 d . ie e 
és=— po n (LI) (in Gaussian units). 


For a rigid (undeformable) loop in a nonferromagnetic medium 
(26.5.2) L = const. Hence 


Es = -L2 (in SI units), 


L L dI i 10) j 
E (in Gaussian units). 


25.2.5 The electromotive force of self-induction causes a 
self-induction current to flow in the loop. According to the Lenz 
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law (25.1.4), this current counteracts the change in the main 
current in the circuit, impeding its increase or decrease. The 
self-inductance of the loop (25.2.3) is a measure of its inertia 
to changes of the current. 

25.2.6 The law for the change of the current in closing and open- 
ing a circuit, i.e. with a nonsteady state in the circuit, is 


I= pe E if (=en RUE), 
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where J, is the current at the initial instant of time (t = 0), 
R is the electrical resistance of the circuit, L is its self-induc- 
tance, and & is the algebraic sum of the emf’s of the sources of 
electrical energy connected into the circuit (21.2.2). 

When the source of emf is switched in there is no initial current 
in the circuit (I) = 0): 


a TOL fc eR 
[= — (1 e ). 


The current in the circuit increases from zero to the value €/R, 
corresponding to that of the constant current. The higher the 


Ow 
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Fig. 25.5 


ratio R/L (Fig. 25.3), the more rapidly the current increases. 
When the source of emf is switched off, 6 = 0: 


I = Ige-Rt/L. 


The current decreases exponentially from the initial value To. 
The higher the ratio R/L, the more rapidly the current decreases 
(Fig. 25.4). 


25.3 Mutual Induction 


25.3.1 An emf is induced in all conductors in the vicinity of a 
circuit in which the current varies. This phenomenon is called 
mutual induction. Upon a change in the current J, in circuit 1 
(Fig. 25.5), for example, an emf €,; of mutual induction is in- 
duced in circuit 2. It equals 


D ; ; 
fym en (in SI units), 


dm: : ; i 
Ey = — 2 ema (in Gaussian units), 
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where mə: is the magnetic flux through the surface bounded 
by the second circuit. This flux is due to the magnetic field 
set up by the current in the first circuit (magnetic flux of mutual 
induction). 

25.3.2 By analogy with Sect. 25.2.2, the magnetic flux ®mə1 
is proportional to the current J,. Thus 


Ome1= Mali (in SI units), 
Omo1 = t Məl; (in Gaussian units), 


where M,, is called the mutual inductance of the second and 
first circuits. The quantity Mı depends upon the shape, di- 
mensions and relative arrangement of the circuits, as well as 
the relative magnetic permeability of the medium surround- 
ing them. The magnetic flux m12 through the surface bounded 
by the first circuit is due to the magnetic field set up by current 
I, in the second circuit. Thus 


Dmi2= Miola (in SI units), 


1 : , i 
Pm = > M,,I, (in Gaussian units), 


where Mı is the mutual inductance of the first and second 
circuits. 

If the circuits are in a nonferromagnetic medium (26.5.2), then 
Me, = Mia. For ferromagnetic media M>, and My, are not 
equal and depend, in addition to the above-mentioned factors, 
on the currents in the two circuits and the way in which they 
vary. | 

25.3.3 The expressions for the emf of mutual induction, under 
the condition that Mı and M,, are both constant values, are 


d : 
60, = —May G (in SI units), 


dt 
a 1 so : l 
6a = — -z Ma T (in Gaussian units), 
d : : 
6j = My S (in SI units), 
1 ls: o n ; ; 
b= — ro Mi- (in Gaussian units). 
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An example of the application of mutual induction is the prin- 
ciple of the transformer which serves to raise or lower the volt- 
age of alternating current. 


25.4 Energy of a Magnetic Field Set up by an 
Electric Current * 


25.4.1 To produce an electric current in a closed circuit (or loop) 
and to increase this current from zero to J it is necessary to do 
work W to overcome the emf of self-induction that impedes 
the increase in current (25.2.6). Thus 

Dms? LI? 


W= 5 9 (in SI units), 


_1 Ops! 1 LE : : 
wa ge at age (in Gaussian units), 


where ms is the magnetic flux of self-induction of the loop 
(25.2.2) and L is the self-inductance of the loop (25.2.2). 
According to the energy conservation law, W determines the 
intrinsic energy Em of current I in a circuit. Thus 


2 
Em = (in SI units), 

1 LI? |. . 
En= a oe (in Gaussian units). 


25.4.2 Increasing together with the current in a circuit is the 
magnetic field set up by this current. Hence, the intrinsic energy 
of a current is dealt with as energy of this magnetic field. In 
a long solenoid (23.3.8), for instance, having a uniform magnet- 
ic field (23.1.2), | 


E m= S Botra? PV (in SI units), 
Eam Er 2nn® PV (in Gaussian units), 


* It is assumed in this section that the conductors carrying 
currents are in a nonferromagnetic, homogeneous anil iso- 
tropic medium, 
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where V is the volume of the solenoid, n is the number of turns 
per unit length, uo is the magnetic constant (23.2.2), and u, is 
the relative permeability of the medium. 


25.4.3 The volume energy density Em of a magnetic field is the 
energy per unit volume of the field. Thus 


d —_ dEm 
m= ay 


For a uniform magnetic field (23.1.3), Em =: Ew V. 
The volume energy density of any, including a nonuniform 
magnetic field, is expressed by the formula 


2 

E% = > BH => prH? = 5 2 (in SI units), 
vA _ BH _uB?__ 1 OB . : 

E n” ar e me (in Gaussian units). 


Here B and H are*the magnitudes of the vectors of magnetic 
induction (23.1.2) and strength (23.2.3) at the point being 
considered in the magnetic field. The remaining notation is 
defined in Sect. 25.4.2. 

25.4.4 The energy Em of a magnetic field, localized in the 
volume V is 


En \ 2H av (in SI units), 
V 


Em = | ao dV (in Gaussian units). 
4 


25.4.5 If the magnetic field is set up by an arbitrary system of 
n circuits, or loops, with the currents J,, I, I3, ..., In, the 
energy of the field can be expressed by the formula of Sect. 
25.4.4, in which B and H are understood to be the magnitudes 
of vectors B and H of the resultant magnetic field, in accordance 


SEYA ELECTROMAGNETIC INDUCTION [25 


with the principle of superposition of fields (15.2.2). Moreover. 
the energy of the magnetic field in this case is 


n 
D kI 
En= > a ae 
k=1 


where mpg is the total magnetic flux linked to the kth loop 
(23.6.5). In calculating this flux, the normal n, to the surface 
bounded by the loop is such that from the head of vector n, 
the current is seen to be counterclockwise. The magnetic flux 
mk equals 


Omk =e Dmsk F Dmmk, 


where Ọmsk is the magnetic flux of self-induction of the kth 
loop (25.2.2) and Mmm, is the magnetic flux of its mutual in- 
duction, produced by all the other loops carrying current 
(25.3.1). Hence the energy Em of the magnetic field is equal to 


n Ly I? i n n 
Baa = +5 >) D, Mulal (in SI units), 
k=1 k=1 I=1 (1R) 


4 n LpI? 1 n n 
Em= ro p aero ` D Mailrpl (in Gaussian 
k=i h=1 l=1 (lh) units). 


The first term is the sum of the intrinsic energies of all the cur- 
rents (25.4.1). The second term is called the mutual energy 
of the currents. Here Mp; is the mutual inductance of the 
say and lth loops, or circuits, (25.3.2) with the currents J, 
and J). | 
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CHAPTER 26 MAGNETIC MATERIALS 
IN A MAGNETIC FIELD 


26.1 Magnetic Moments of Electrons and Atoms 


26.1.1 Magnetic materials are all substances capable of acquir- 
ing magnetic properties, i.e. of being magnetized, in an external 
magnetic field. This means that they can set up their own mag- 
netic field. The magnetic properties of substances are determined 
by the magnetic properties of their electrons and atoms.* 
Magnetic materials are divided according to their magnetic 
properties into three main groups: diamagnetic A para- 
magnetic (26.3.5) and ferromagnetic (26.5.1) materials. 
26.1.2 The motion of an electron along its orbit in an atom is 
equivalent to a certain closed loop carrying a current (orbital 
current). According to Sect. 23.3.4, the orbital magnetic moment 
Pm of an electron equals 


Pm = JSn (in SI units), 
Pm = = ISn (in Gaussian units), 


where J = ev is the current, e is the magnitude of the charge of 
the electron, v is the number of revolutions of the electron along 
its orbit in unit time, S is the area of the electron orbit, n is 
a unit vector of the normal to area S, and c is the electrodynamic 
constant (23.2.2). 

An electron moving along its orbit has an orbital angular mo- 
mentum L, (4.1.4). The orbital magnetic moment is proportion- 
al to the orbital angular momentum: 


Pm = YLe, 

where v=- (in SI units), 
e g . E 

eene (in Gaussian units). 


* The magnetic properties of atomic nuclei are dealt with in 
Sect. 42.1.6. 
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ee TOU y is called the gyromagnetic ratio of orbital mo- 
ments. 

In the last formulas m is the mass of the electron, e is the magni- 
tude of its charge, and c is the electrodynamic constant (23.2.2). 
Vectors pm and L, are opposite in direc- 
tion and are perpendicular to the or- 
bital piane of the electron (Fig. 26.1). 
26.1.3 An electron has an intrinsic 
angular momentum s, which is called 
the spin of the electron. The magnitude 
of the electron spin is 


Fig. 26.4 


where h is Planck’s constant (Appen- 
dix II), and A = h/2n. The most impor- 
tant feature of electron spin is that it has only two projections 
on the direction of vector B of magnetic induction.* Thus 


h 
sp= ts. 


26.1.4 Electron spin s corresponds to the spin magnetic moment 
Pms which is proportional to the spin and opposite in direction. 
Thus 


Pms = YsS- 


The quantity ys is called the gyromagnetic ratio of spin momenis: 
e ; 
C (in SI units), 
e š : A 
A S US). 
Ys = (in Gaussian units) 
where the notation is the same as in Sect. 26.1.2. 


* Regardless of whether it is an external magnetic field set up, 
for instance, by current-carrying conductors, or the internal 
magnetic field of the substance itself (26.4.2). 
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The projection of the electron spin magnetic moment on the 
direction of the field (26.1.3) equals 


eh : 
Pmsp ~ + Dm = up (in SI units), 


eh 
2mc 


Pins p ~ as =+ upg (in Gaussian units), 


where up is called the Bohr magneton, and is a unit for measur- 
ing magnetic moments (Appendix II). 

26.1.5 The information presented in Sects. 26.1.1 through 
26.1.4 is valid for each of the Z electrons in an atom. The number 
Z coincides with the atomic number in Mendeleev’s periodic 
table (39.3.5). 

The orbital magnetic moment Pm of an atom is the vector sum 
of the orbital magnetic moments p,, of all its electrons. Thus 


Z 
Pm = > Pmi: 


i=l 


The orbital angular momentum L of the atom is the vector sum 
of the orbital angular momenta L, of all Z electrons. Thus 


Z 


lee) Lee 


i=1 


The atomic orbital magnetic moment Pm and angular momentum 
L have the following relationship: 
Pn = YL, 


where y is the gyromagnetic ratio (26.1.2). 


26.2 An Atom in a Magnetic Field 


26.2.4 If a substance is in an external magnetic field, this field 
can be assumed uniform (23.1.3) within the limits of an atom. 
This follows from the smallness of atomic dimensions. Let us 
assume that an electron in the atom travels along a circular 


~2—0186 
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orbit. whose plane: is’ perpendicular to the induction vector B 
of the magnetic field. The action of the Lorentz force FL (24.4.1) 
reduces the force of attraction of the electron to the nucleus. 
The centripetal force (2.4.3) is found. to equal the difference 
Fe — Fu, where Feis the Coulomb force (14.2.2) of attraction 
of an electron to the nucleus 
(Fig. 26.2). This results in a 
change in the angular velocity 
W do of the electron along 
its circular orbit. i , 
26.2.2 The change in the angular 
‘velocity of the. electron occurs 
in. the process of growth of the 
‘magnetic field into which’ the 
atom is placed. This growth 
Fis. 26.2 +. takes place during ʻa finite length 
Bas of. time. “At this an induced 
on rotational electric field (25.1.1) 
is developed that acts on the electron in the atom. The strength 
E of this field is directed along a tangent to the electron’s orbit 
and the force acting on the eoa is F = eE (15.1.2). 
26.2.3 Upon an arbitrary position of the electron’s orbit with 
respect’to vector B, the orbital magnetic moment pm of the 
electron (26.1.2) makes’ an angle œ with the direction of the 
magnetic field (Fig. 26.3). In this case there will be a precession- 
al motion of the orbit about the direction of vector B (4.3.2). 
This means that vector p,,, perpendicular to the orbit and main- 
taining its angle œ with the field constant, rotates about the 
direction of: vector B with- the angular. velocity: ®L. Thus 


eB a 
OL = 5; (in SI units), 

eH Berit wlan Nee 
aa (in Gaussian unitsy. 


Here e is the magnitude of the charge of the electron, m is its 
mass, H is the magnetic field strength, c is the electrodynamic 
constant (23.2:2), and @L is.called the angular velocity of Larmor 
precession. a E i E 

Larmor’s theorem states that the only result of the effect of a 
magnetic field on an electron’s orbit. in-an atom .is.the preces- 
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sion of the orbit and vector pm with the angular velocity ;, 
about an axis through the nucleus of the atom and parallel to 
the induction vector B of the magnetic field. 

26.2.4 The precessional motion of the electron’s orbit results 
in an additional orbital current AIorp (Fig. 26.3) and the cor- 


Precession of the electron 
and of ils orbital magnetic 
moment i 


Additional (precessianal) | 
motion of the electron 


Fig. 26.3 a 


ant ding induced orbital magnetic moment Ap,, (26.4.2) 
with the magnitude ` ' | : - i l ie 


i 


e?S 
APm = Alorps | = Asm B (in SI units), 
4 es, a 
APm = a IorbS | a ET B (in Gaussian Gata) 


where Sis the area of the projection of the orbit on a plane 
perpendicular to the direction of vector B. Vector Ap,, has 


22* 
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the direction opposite to the magnetic induction vector B. 
Thus 


e2S$ 


at Oe! W ; . ; 
A Pm = Tas B (in Si units), 
ied B (in Gaussian units) 
APm = ~ 4nme2 l 


The notation is the same as in Sect. 26.2.3. 
The total induced orbital moment APm of the atom (26.1.5) 
is 


eZ (S |) ; 
APm = — lan B (in SI units), 

eZ (5 |) . : , 
APm = — Bone B (in Gaussian units), 


where Z is the number of electrons in the atom, and (S1) = 
Z 

-( $ So; /Z is the average area of the projections of the 
i=1 


electron orbits on a plane perpendicular to the direction of 
vector B. 


26.3 Diamagnetic and Paramagnetic Materials in a 
Uniform Magnetic Field 


26.3.1 The magnetization of matter is characterized by the 
magnetization vector (magnetization intensity) M, which is the 
ratio of the magnetic moment of a small volume AV of the sub- 
stance to the magnitude of this volume. Thus 


A N 
M = ~y Dy Pmi» 


i=1 


where Pm; is the magnetic moment of the ith atom (or molec: le), 
and N is the total number of atoms (or molecules) in the small 
volume AV. Within the limits of the volume AV the magnetic 
field is assumed uniform. This volume should simultaneously 


26.3) DIAMAGNETIC AND PARAMAGNETIC MATERIALS 341 


contain a sufficiently large number of particles N (i.e. N > í) 
to enable the physical quantities characterizing the system of 
particles to be expediently averaged. 

26.3.2 Diamagnetic materials are substances in which the mag- 
netic moments of the atoms (or molecules) equal zero in the 
absence of an external magnetic field. This means that in dia- 
magnetic materials the vector sum of the orbital magnetic 
moments of all the electrons in an atom equals zero (26.1.5) 
and that induced magnetic moments (26.2.4) exist only in a 
magnetic field. 

26.3.3 In the volume AY of an isotropic diamagnetic material, 
the induced magnetic moments AP,, of all the atoms (or mole- 
era ie the same and are opposite in direction to vector B 
26.2.4). 

The magnetization vector M is equal to 


N APm 
AV 


and, taking Sect. 26.2.4 into account: 
nge?Z (S 
0 ( 1) B 


M = = Ro APm, 


M = — ih (in SI units), 
n e2Z (S 
M = aN EN (in Gaussian units), 
4nmc? 


where ny is the number of atoms (or molecules) in unit volume. 
The other notation is the same as in Sect. 26.2.4. Introducing 
the notation: 


, noeZCS |) Ho , . 
Xm =a ee aan (in SI units), 

, nge?Z (S| ) š E . 
L SER (in Gaussian units), 
we have 


M= tin (in SI units), 


M= y,,B (in Gaussian units), 
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where y,, is a dimensionless quantity that characterizes the mag- 

‘netic properties of magnetic materials. For all diamagnetic 
materials X% < 0 a, 
26.3.4 The magnetic susceptibility X is a. quantity that is re- 
lated to X/, as follows: | 


1+Xm = (in SI units), 
uo ti m 


1 


————— (in Gaussian units), 
1— 40x, ( ) 


from which 
Xm = Xm (in SI units) 
m 1— x, , 


Xm = =————-__ (in Gaussian units). 


Practically, Xm = Xm for diamagnetic materials, because. in 
absolute value y,, is very small: | X | ~ 10-8. 


26.3.5 Paramagnetic materials are substances in which the 
atoms (or molecules) have a certain constant magnetic moment 
Pm in the absence of an external magnetic field. This means 
that the vector sum of the orbital magnetic moments .of all 
‘the: electrons. of an atom (or molecule) is not: equal to zéro 
(26.4.5). | : 
26.3.6 When a paramagnetic material is placed into a.uniform 
magnetic field (23.1.3), the constant magnetic moments of the 
atoms (or molecules) precess about the direction of the induc- 
tion vector B of the magnetic field at Larmor’s angular velocity 
Or, (26.2.3). 

Thermal motion and collisions of the atoms (or molecules) of 
a paramagnetic material lead to a gradual damping of the pre- 
cession of the magnetic moments and to a reduction in the an- 
gles between the directions of the magnetic moment vectors 
and vector B. The combined actions of the interatomic collisions 
and the magnetic field lead to a predominant orientation of the 
magnetic moments of the atoms, in alignment .with: the:external 
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field. Though the constant magnetic moment Pm of an atom 
(or molecule) is of the order of 10-33 joule/tesla (10-2? erg/gauss), 
the magnetic moments of all the particles in unit volume pro- 
duce a magnetization that considerably exceeds diamagnetic 
phenomena (26.3.3). A paramagnetic material in an external 
magnetic field'has an intrinsic magnetic field aligned with the 
external. magnetic field. oe E S 
26.3.7 In the classical theory of paramagnetism the magnitude 
of the magnetization vector (26.3.1) is expressed by the formula 


M = noPoL (a), a o aF 


where no is the number of atoms (or molecules) in unit yolume, 

and Z (a) is the classical Langevin function: — | Hoe 
: a sä $ : a « ý 

L (a) PR a E 


ea — e~a `a > , 

The parameter a is of the form a = P,,B/kT,. where B is the 
magnetic induction. of the field, k is Boltzmann’s constant 
(8.4.5), and T` is.the absolute temperature. At room tempera- 
ture and not especially strong external fields, a< 1 and the 
function LZ (a) is, simplified, after expansion into a series, and 
we obtain ZL (a) ~*a/3. Hence, the magnetization vector, is 
equal to — a | no 


M=¥Xn Ba ‘(in ST units), 
to : 
M=y¥,,B (in Gaussian units), 


where -y’,, is determined -by the formula 


j NyP 2 Wo r ; $ s Drey j ; 
Xm =— 3T (in SI unite); Re 

oo nP e Ga sj it i a 
Xm = -aT (in Gaussian units). 


The quantity Xm is related to the magnetic susceptibility Xp, 
of paramagnetic materials by the formulas given in Sect. 26.3.4. 
The value of the quantity Xm is positive. for a paramagnetié 
material and ‘ranges from 10-5 to 10-8; therefore, La = Ge 
with a high degree of accuracy. $ 
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The Curié law states that the paramagnetic susceptibility of 
a substance is inversely proportional to the absolute tempera- 
ture. 

26.3.8 In very strong external magnetic fields, saturation magne- 
tization is reached; here aœ í and the Langevin function 
L (a) > 1. This means that the magnetic moments of all the 
atoms (or molecules) are aligned with the external magnetic 
field and M = noPn. 


26.4 Magnetic Field in Magnetic Materials 


26.4.1 Two types of currents that set up magnetic fields, ma- 
crocurrents and microcurrents, can be distinguished in matter. 
What we have called macrocurrents here are conduction currents 
(20.1.2) and convection currents (20.1.2). Microcurrents (mole- 
cular currents) are those due to the motion of electrons in atoms, 
molecules and ions. 

The magnetic field in a substance is the vector sum of two fields 
(15.2.2): the external magnetic field, set up by the macrocurrents, 
and the internal, or intrinsic, magnetic field, set up by micro- 
currents. The magnetic induction vector B (23.1.2) in the mag- 
netic field in a substance characterizes the resultant magnetic 
field and is equal to the vector sum of the magnetic inductions 
of the external Bọ and intrinsic Bintr magnetic fields. Thus 


B = Bo + Bintr- 


The primary source of magnetic fields in magnetic materials 
are the macrocurrents. Their magnetic fields are what leads 
to the magnetization of substances placed in an external mag- 
netic field. 

26.4.2 The total current law for a magnetic field in a substance 
is a generalization of the law formulated in Sect. 23.5.2: 


 Bdl=p (LmactImic) (in SI units), 
L 


1 
È B al == (Imac +Imte) (in Gaussian units), 
L 
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where Imac and mic are the algebraic sums of the macro- and 
microcurrents passing through a surface bounded by the loop L. 
26.4.3 The algebraic sum of the microcurrents is related to 
the magnetization vector M as follows: 


Ime = Mal (in SI units), 
L 


Laie — Mdl (in Gaussian units), 
L 


where $ Mdl is the circulation of the magnetization vector M 


(26.3.1) along the closed loop Z through which the microcurrents 
are threaded. 
The final form of the total current law (26.4.2) is 


(———) dl=Imac (in SI units), 
T, Lo 

4n : : f 
 B—40M) dl =a Imac (in Gaussian units). 
L 
26.4.4. The vector 


H = B m (in SI units), 


Ho 
H—B—4nM (in Gaussian units) 
is called the strength of the magnetic field existing in an arbi- 
(rary medium (cf. Sect. 23.2.3). The total current, law for a 
magnetic field in an argftrary medium is written in a form iden- 
tical to that in Sect. 23.5.4: 


L 


$ nal—— Imac (in Gaussian units). 
L 
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The circulation of the magnetic field strength vector along an 
arbitrary closed loop is equal (or proportional) to the algebraic 
“fee of the macrocurrents through the surface bounded by this 
oop. 

96.4.5 For an isotropic medium, the relation between the mag- 
netic induction vector B and the magnetization vector M 
(26.3.1) leads to the following result for vector H (26.4.4): 


H=(1— Xm) = (in SI units), 
0 
H=(1—4ny,,) B (in Gaussian units). 


Substituting for (1 — vm) or (1 —4 Tym) on the basis of Sect. 
26.3.4, we have l 


H| = B (in SI units), 
Mor 


H = E (in Gaussian units), 
r 


where 
u, = 1+ Xm (in SI units), 
ur = 1 + 4n%m (in Gaussian units). 


The quantity u, thus introduced is called the relative magnetie 
permeability of the substance (see also Sect. 23.2.1). In these 
formulas y, is the magnetic susceptibility (26.3.4). 


26.5 Ferromagnetic Materials 


26.5.1 Ferromagnetic materials are substances in which the 
intrinsic (internal) magnetic field (26.4.1) is hundreds or even 
thousands of times stronger than the external magnetic field 
applied to the substance. 

Ferromagnetism is observed in the crystals of the transition 
metals (39.3.8), iron, cobalt and nickel and of certain alloys, 
provided that d/a> 1.5, where d is the diameter of the atom 
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ao A the diameter of the unfilled electronic shell of the atom 
26.5.2 The principal properties of ferromagnetic materials, 
‘distinguishing them from other magnetic materials are: _ 

(a) The dependence of the magnetization (26.3.1) on the strength 
H of the external magnetic field is characterized by magnetic 
saturation Ms, which begins at H > Hg (Fig. 26.4). 

(b) The dependence of the magnetic induction B on H is charac- 
terized by the increase according to a linear law at H > Hs 
(Fig. 26.5). | | | 


Fig. 26.4 Fig. 26.5 


(c) The dependence of the relative magnetic permeability up 
on the field strength H is of a complex nature (Fig. 26.6). 

(d) Magnetic hysteresis exists in ferromagnetic materials. This 
refers to the lag in the variation of magnetization from the 
variation in the strength of the external magnetizing field, 
which is variable in magnitude and direction. This lag is due 
to ‘the dependence of M on its previous values (past history) 
in magnetizing the substance. | 

(e) The properties of ferromagnetic materials enumerated 
above* are observed at temperatures below the Curie point Qc 
(18.4.4). At temperatures 7 > 6c, thermal motion breaks up 
the regions of spontaneous magnetization (26.5.4) and the 


* As well as many others that are beyond the scope of this 
handbook. 
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ferromagnetic material, losing its special features, is converted 
into a paramagnetic substance (26.3.5). The Curie points for 
ae cobalt and Permalloy are 1043, 631, 1403 and 
823 K. 

26.5.3 The hysteresis loop is a curve showing the change in the 
magnetization of a ferromagnetic material located in an exter- 
nal magnetic field as the strength of this field is varied from -++ A, 
to —H, and back again, where Hg, is the field strength corre- 
sponding to magnetic saturation (Fig. 26.7). The magnetiza- 
tion +M, at H = +H, is called saturation magnetization. The 
magnetization +M, that remains in a ferromagnetic material 


0 H 
Fig. 26.6 Fig. 26.7 


in the absence of an external field (H = 0) is called the residual, 
or remanent, magnetization. This factor is the basis for making 
permanent magnets. The strength +H, of the external field 
that completely demagnetizes a substance is called the coercive 
force. The coercive force specifies the property of a ferromagnetic 
material to retain its residual magnetization. Hard, or retentive, 
magnetic materials have a high coercive force. They have a wide 
hysteresis loop and are used to make permanent magnets. Soft, 
or non-retentive, magnetic materials have a low coercive force. 
They have a narrow hysteresis loop and are used to make trans- 
former cores. 

The reversal of magnetization is associated with a change in 
the orientation of the regions of spontaneous magnetization 
(26.5.4) and requires that work be done at the expense of the 
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energy of the external magnetic field. The amount of heat evolved 
in reversing magnetization is proportional to the area of the 
hysteresis loop. 

26.5.4 At temperatures below the Curie point (26.5.2) a ferro- 
magnetic material is made up of domains, which are small re- 
gions of uniform spontaneous magnetization. The linear dimen- 
sions of these domains range from 10- to 10-4 m. Within each 
domain the substance is magnetized to saturation (26.5.2). 
In the absence of an external magnetic field, the magnetic mo- 
ments of the various domains are oriented in space so that the 
resultant magnetic moment of the whole ferromagnetic body 
equals zero. 

Owing to the effect of the external magnetic field, the magnetic 
moments of whole domains are oriented in a ferromagnetic 
material, rather than those of the separate particles as in the 
case of paramagnetic materials (26.3.6). As a result, the sub- 
stance is magnetized. 

l‘erromagnetic properties can only exist in the crystalline state 
of substances in which the interaction between the neighbouring 
atoms of the crystal lattice leads to the total energy of the sys- 
tem of electrons that provides for compliance with the condi- 
tions for the existence of ferromagnetism (26.5.1). 

26.5.5 Measurement of the gyromagnetic ratio (26.1.4) for 
ferromagnetic bodies indicates that the elementary carriers 
of magnetism in ferromagnetic materials are the spin magnetic 
moments of the electrons (26.1.4). The modern quantum mechan- 
ical theory of ferromagnetism explains the nature of the spon- 
taneous magnetization of ferromagnetic materials (26.5.4) and 
the reasons for the development of a strong intrinsic magnetic 
field (26.4.1). 

Ferromagnetic properties are possessed by crystals of substances 
whose atoms have inner shells and subshells (39.3.6) unfilled 
by electrons and having a nonzero value of the projections of 
the resultant spin moment on the direction of the magnetic 
field (26.1.3). Developed between the spins of such electrons 
is a specific kind of quantum mechanical interaction that is 
not of a magnetic origin. It is called exchange interaction 
(39.4.5). As a result, the state of the system of electrons in fer- 
romagnetic materials is found to be stable upon parallel align- 
ment of the spins, and spontaneous magnetization (26.5.4) 
is developed along with a strong intrinsic magnetic field. 
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CHAPTER 27 FUNDAMENTALS 
OF MAXWELL'S THEORY 


27.1 General Features of Maxwell's Theory 


27.1.1 Maxwell's theory is a systematic theory of a unified 
electromagnetic field (15.1.2) set up by an arbitrary system of 
charges and currents. Maxwell’s theory enables the principal 
problem of electrodynamics to be solved: determining the char- 
acteristics of electric and magnetic fields on the basis of a given 
distribution of charges and currents that set up these fields. 
Maxwell’s theory is a generalization of fundamental laws that 
describe electrical and electromagnetic phenomena. These 
include the Ostrogradsky-Gauss theorem (18.3.3), the total cur- 
rent law -(26.4.2), and the law of electromagnetic induction 
25.1.2). 

AE Maxwell’s theory is of a phenomenological nature. 
This is manifested in the fact that it does not deal with the 
internal mechanism of the phenomena that occur in a medium 
and lead to the production of electric and magnetic fields. The 
medium is described in this theory by three quantities that 
specify its electrical and magnetic properties. These are the 
relative permittivity (dielectric constant) er (18.3%), relative 
magnetic permeability p, (26.4.5) and the electrical conductiv- 
ity x (20.3.4). 

27.1.3.:Maxwell’s theory deals with macroscopic fields set 
up by macroscopic charges and currents that are concentrated 
in volumes incommensurably greater than those of. atoms (or 
ee It is assumed that the distances from the sources 
of the fields to the points in space being considered substan- 
tially exceed the linear dimensions of atoms (or molecules). 
Therefore, macroscopic fields change appreciably only at dis- 
tances that are huge compared to the linear dimensions of atoms 
(or molecules). Moreover, the periods of time required for changes 
in variable electric and magnetic fields are considered to be 
considerably longer than those required for intramolecular 
processes. . oe 
27.1.4 The macroscopic charges and currents are the sets of 
microscopic charges and currents that set up their microfiélds 
(electric and magnetic), varying continuously with time at 
each point in space. 
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The macroscopic fields dealt with in Maxwell's theory are 
averaged microfields. Averaging is carried out over time inter- 
vals considerably longer than those typical of interatomic 
processes, and over volumes of fields many times greater than 
the volumes of atoms and molecules (27.4.5). 

27.1.5 Maxwell's theory is one of short-range action, according 
to which electrical and magnetic interactions take place in 
electric and magnetic fields and propagate at a finite velocity 
equal to that of light in the given medium. This important 
consequence is taken into account in the electromagnetic theory 
of light developed by Maxwell. 


27.2 Maxwell's First Equation 


27.2.1 Maxwell's first equation in integral form is a generaliza- 
tion of Faraday’s law of electromagnetic induction in the form 
(25.1.2): 


eC c 

D E dl= — om (in SI units), 

L 

: ) 

QE dl = a (in Gaussian units). 
c at 

L 


According to Maxwell, this law is valid for any closed circuit, 
or loop, and not only current-conducting ones, conceivably 
selected in a variable magnetic field. This means that a variable 
magnetic field sets up a rotational electric field at any point in 
space, regardless of whether or not there is a conductor at that 
point. 
27.2.2 Using the following expression for the magnetic flux 
(23.5.5): 


5 S 


and the Stokes theorem from vector analysis: 


- 


$ Edl= $ curl E dS, 
S S 
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where dS = n dS (n being the unit vector of the normal to the 
element dS of surface), we can convert Maxwell’s first equation 
given in Sect. 27.2.1 to Maxwell’s first equation in differential 
orm. 


ðB 
curl E = = (in SI units), 
1 OB. ; 
curl E = S ee (in Gaussian units). 


Here curl E is expressed in Cartesian coordinates by the deter- 


minant 
i j k 
E ð ð O 
curi lor ôy Oz 
Ex Ey E, 


27.2.3 An inductive type of electron 
accelerator, the betatron (24.4.2), is 
based on setting up a rotational 
electric field in space by the action 
of an alternating magnetic field. 
In the alternating magnetic field pro- 
duced by an electromagnet with coni- 
cal pole pieces A and C (Fig. 27.1), 
a rotational electric field is set up in 
accelerating chamber D, having the 
form of a closed ring. The lines of 
Fig. 27.4 force (15.4.6) of the rotational elec- 
rope tric field lie in plane MN, which 
is perpendicular to the axis OO’ 
of symmetry of the pole pieces. These lines are circles with 
their centre at point K. At all points of any circle the 
field strength vector E has a constant numerical value and is 
tangent to the circle. Electrons in the accelerating chamber travel 
along circular paths and after repeated travel along the stable 
circular orbit the electron is accelerated to considerable energy. 
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27.2.4 The strength E of the rotational electric field in a beta- 
tron is equal to 
1 d(B) 


E=- =< 


2 dt ’ 


where (B) is the average value of magnetic induction at the 
instant of time t within the limits of the circular electron orbit 
of radius r. 

A condition for a stable orbit of the electron in the betatron is: 
B = (B)/2, where B is the instantaneous value of the magnetic 
induction on the orbit. 

The orbit of an electron in the betatron is stable if: 

(a) The whole orbit lies in a single plane. This condition, called 
axial focusing, is achieved due to the special shape of the pole 
pieces of the electromagnet. This shape provides for gradual 
weakening of the magnetic field induction in the direction 
from the centre of the orbit to its periphery. 
h Provision is made to return electrons that have acciden- 
tally left the stable orbit back to it again. This is called radial 
focusing and is achieved by a space distribution of the magnet- 
ic field in which the, magnetic induction decreases more slowly 
from the axis to the periphery than 1/r, where r is the distance 
from a given point in the field to the axis of symmetry OO’. 


27.3 Displacement Current. Maxwell's Second 
Equation 


27.3.1 Maxwell generalized the total current law, (26.4.2) and 
(26.4.4), by assuming that a variable electric field, in the same 
way as an electric current, is a source of a magnetic field. The 
quantitative measure of the magnetic effect of a variable electric 
field is the displacement current. 

27.3.2 The displacement current density (20.2.3) is 


p ôD . ; 

ias = 22 (in $I units), 
= 1414D 
Jdis = 4n ot 


23—0186 


(in Gaussian units). 
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The displacement current through the arbitrary surface S is 
a physical quantity numerically equal to the flux of the dis- 
Poa emon current density vector through this surface. 
Thus 


C aD 
Tais= | jasas= | 5? a8—S2 (in SI units), 
S S 
, 1 aD 1 dD, 
ldis= f Jdis dS = \ Zn ot eae (in Gaussian 
S S units), 


where v= | D dS is the flux of the electric displacement 


S 
vector through the surface S (15.3.2). 
Taking the displacement currents into account, any variable 
current has a closed circuit. Displacement currents “flow” through 
a part of a circuit where there is no conductor, for example, 
between the plates of a capacitor being charged or discharged. 
Shown in Fig. 27.2 are the vectors jqis and the lines of induction 
of the magnetic fields of displacement currents in charging 
(Fig. 27.2a) and in discharging (Fig. 27.2b) a capacitor. 
27.3.3 According to Sect. 18.3.4, the displacement vector in 
any dielectric is equal to 


D = &E -- Pẹe (in SI units), 


D=E+ 4nP, (in cgse units), 


where P, is the polarization vector (18.2.3). 
The displacement current density in a dielectric is 


, ðE oP ae : 
İdis = £o -57 + va (in SI units), 


OPe ; : 
-—— S 11ts). 
{ T (in cgse units) 
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In these last formulas, the first term £ E (or, correspondingly, 


1 oÆ 
4n ðt 
whereas the second term 6P,/0¢ is called the polarization current 
density. The second term is the density of the current due to 


) is called the displacement current density in vacuum, 


I T 
+ y + + | + 
MT yy” a P 
B 
- ôD 
“Jais FF 
(a) (b) 


Fig. 27.2 


the ordered motion of charges in the dielectric: either the dis- 
placement of charges in a molecule of a nonpolar dielectric. 
(18.1.3) or the turning of the dipoles in polar dielectrics (18.1.5). 
The displacement current does not evolve Joule heat (21.2.6) 
in vacuum or in metals. In this it differs from conduction cur- 
rents. 

27.3.4 Maxwell added the displacement current to the right- 
hand side of the total current law in the form given in Sect. 
26.4.4, writing this law in the form 


\ H dl=JImac+Jais (in SI units), 
L 


23E 
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a (Imac+{Iqis) (in Gaussian units). 


This equation is called Maxwells second equation in integral 
form. It shows that the circulation of the magnetic field strength 
vector along an arbitrary closed circuit, or loop, L is equal to 
the algebraic sum of the macrocurrents and the displacement 
current through a surface bounded by this:loop. 

27.3.5 Making use of the Stokes theorem from vector analysis 


b Ha=> curl H dS, {where dS=nd5S, and n is a unit 
L S 


vector of the normal to the element dS of surface, and of the 
expression for the total current 


a EE A | (jats) dS, 


Mazwell’s second equation can be written in differential form: 
., oD : 5 
curl aX (in SI units), 


1 oD 


curl neo L j4+— ae 


(in Gaussian units), 


In these satan curl H has the same meaning as curl E in 
Sect. 27.2.2. 

27.3.6 In the absence of conduction currents (j = 0), Maxwell’s 
first and second equations are of symmetrical form except for 
sign in their right-hand sides: 


OB aD . l 
curl E= — Eri ; curl lee un SI units), 
curl E= — a curl Hane (in Gaussian units). 
c ôt c ót 


Comparing Maxwell's equations*, we reach the following con- 
clusions: 


* The numeration of Maxwell’s equations is a conditional one 
and is frequently the opposite of that accepted in this handbook. 
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(a) Electric and magnetic fields are interrelated: a change of 
an electric field with time sets up a magnetic field*. In its 
turn, a variable magnetic field is the source of a rotational elec- 
tric field. 

(b) The different signs at the right-hand sides of Maxwell's 
equations comply with the law of conservation of energy and 
with the Lenz law (25.1.4). If 

the signs of 0B/dt and @D/at aD dB 
were the same, an infinitesimal Ot at 

increase in one of the fields would i 


lead to an increase without li- n n 

mit of both fields, whereas an ,-~77~ ~~, TITON 
infinitesimal decrease in one of | 

the fields would lead to the dis- E 
appearance of both fields. The H 
above-mentioned difference in 
the signs of the right-hand (a) ( b) 


sides of Maxwell’s equations is a 

necessary condition for the exi- Fig. 27.3 

stence of a '’stable electromag- grar 

netic field . 

The different signs in the right-hand sides of Maxwell’s equa- 
tions correspond to the fact that the directions of 0D/dt and H 
form a “right-hand screw” system (Fig. 27.3a), whereas the 
directions of dB/dt and E form a “left-hand screw” system 
(Fig. 27.3b), ` | 


27.4 Complete Set of Maxwell's Equations for an 
Electromagnetic Field 


27.4.1 A complete set of Maxwell’s equations for an electro- 
magnetic field includes, in addition to the equations dealt with 
in Sects. 27.2.1, 27.2.2, 27.3.4 and 27.3.5, the Ostrogradsky- 
Gauss theorem for an electric field (18.3.3): 


$ D dS = qfreẹ (in SI units), 
s 


* The magnetic field is always a rotational ono (23.5.3). 
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a 
D D dS = 4nqfree (in cgse units) 


S 

and the same theorem for a magnetic field (23.5.6): 
$ Bas=0. 

sS 


Maxwell proposed that the theorem for the flux of an electric 
field displacement vector is valid, not only for a steady-state 
electrostatic field, but for a variable electric field as well. 

27.4.2 Making use of the Gauss theorem from vector analysis: 


$a dS = | div A dV, and introducing the volume density p 
S 4 


of free charges: 0 = dqfree/dV (where dV is an element of vol- 
ume), it is possible to obtain Maxwell's third equation in diferen- 
tial form: 


div D = p (in SI units), 
div D = 4np (in Gaussian units). 


In these equations, div A (where A is an arbitrary vector) is 
determined in Cartesian coordinates as follows: 


; 0A, OAy 0A, 
Wwe sper oe on 


£ 
where A = A i+ A,j + A-K, and i, j, and k are unit vectors 
along the coordinate axes. 
27.4.3 The complete set of Maxwells equations includes four 
equations: 


ôB i 
curl E= — -JF '’ div D=o 
(in SI units), 
“3 ~. ôD oe 7 
curl H=j+ -zr ' div; B=0 
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1 OB 


curl E= E div D = 4np 
(in Gaussian units). 
An ,, 1 OD : 
curl H =- j Poy ae ; div B—0. 


27.4.4 The system of Maxwell’s equations is supplemented ‘by 
equations that specify the electric and magnetic properties 
of the medium. For macrocurrents in an isotropic medium, which 
comply with Ohm’s law (20.3.4), these equations are of the 
‘orm 


D = e9&,E, B = pot,-H, imac = XE (in SI units), 
D = &E, B=u,H, ima, = XE (in Gaussian units). 


Here & and u, are the electric constant (14.2.7) and the magnet- 
ic constant (23.2.2) in SI units, e, and u, are the relative dielec- 
tric constant and relative magnetic permeability, and x is the 
electrical conductivity. 

To solve a system of Maxwell’s equations, it is also necessary 
to specify the boundary conditions for the vectors character- 
izing the electromagnetic field. Thus 


Pay Png =% 2 
(in SI units), 
By, = Bag’ H — Hy, = jsur 
Dajm Da mA, Et = Etg 
(in. Gaussian units), 
A 
By, = Fry 5 c a a Jsur 


where o is the surface density of free charges, n is a unit vector 
of the normal to the interface (boundary) between media and 
extends from medium 2 to medium 1, t is a unit vector of the 
tangent to the interface, and j sy, is the projection of the density 
vector of surface conduction currents on the direction [tn]. 
For given boundary and initial conditions, i.e. known values 
of.-yectors._E _and Hat. the initial instant. of time (¢.==.0),: the 
system of Maxwell’s equations has a unique solution. — 
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27.4.5 Maxwell’s equations are invariant with respect to the 
Lorentz transformations (5.3.2). 

In the special theory of relativity (5.1.1) it is shown that a 
common electromagnetic field can be different in different iner- 
tial frames of reference (2.1.2). In particular, one of the fields, 
either electric or magnetic, may be absent in one of the coordi- 
nate systems and be present in the other. The following formu- 
las are used for the Lorentz transformations of the projections 
on the coordinate axes of the vectors E, H, D and B of the elec- 
tric and magnetic fields, in going over from fixed inertial frame 
of reference K to frame K’, which travels with respect to frame 
K at the constant velocity V in a straight line along the OX 
axis: 


BoE, Ea oh p E 
V 1—V?/c2 V 1—V2/c? (in SI 
r , t , units), 
yp go wet VD, ) 
PE U Geass 
a V ’ ’ y , 
eee Ot Gee 
ee Vi-vae 7 ViVe 
r V a , V , 
B.=B'. B Sy a*r B + -m Ey 
ii ” VIve? i AETA m r ` (in Gaus- 
sian 
ELI H' pI H’ units). 
Paro Gee, ee 
se V VIV © VIV ' 
Hy~ > E; H+- E, 
H;=Hi, Hy = 


ee, ae 
V 1—V?/c? Z ViVe 


27.4.6 Tbe classical Lorentz theory of the electron is a further 
‘development of Maxwell’s theory of the electromagnetic field. 
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The Lorentz theory is based on a definite model that implies 
certain conceptions of the structure of matter. It is assumed that 
atoms consist of negatively and positively charged particles, 
and that the great variety of electrical and magnetic phenomena 
can be explained by a definite spatial arrangement, motion and 
interaction of charges and microcurrents. At each point in 
space certain electric and magnetic microfields exist with the 
strengths e and h. These fields result from the combined action 
of all the charges and microcurrents. The microfields comply 
with a system of equations similar to Maxwell’s equations 
(27.4.3). Maxwell’s equations for macroscopic fields E and B 
(27.1.3) can be obtained by averaging the equations of the 
electron theory (27.1.4), and they are found to be the averaged 
microfields e and h. Thus 


E = (e), B= po (h) (in SI units). 


Vectors D and H are found to be related to (e) and (h) by the 
polarization vector P, (18.2.3) and the magnetization vector M 
(26.3.1) as indicated in Sects. 18.3.4 and 26.4.4. Thus 


D=e,E+ Pe, H=- —M (in SI units), 
* Ho 


D=E-+47Pe, H=B—4nM (in Gaussian units). 


PART FOUR OSCILLATIONS 
AND WAVES 


CHAPTER 28 FREE HARMONIC OSCILLATIONS 


28.1 Harmonic Oscillations 


28.1.1 Oscillations, including vibrations and variations, are 
rocesses (motions or changes of state) that are repeated more or 
ess regularly with time. According to the physical nature of 
the oscillatory process and the “mechanism” by which it is 
initiated, distinction is made between mechanical oscillations, 
commonly called vibrations (swinging of pendulums; vibration 
of strings; reciprocation and oscillation of machine components; 
vibration of buildings, bridges and other structures; compres- 
sion and rarefaction of air as sound is propagated through it; 
rolling, pitching and heaving of ships; sea waves; etc.), electro- 
magnetic oscillations (oscillations of alternating electric current 
in a circuit, variations in the field strength and magnetic induc- 
tion vectors, E and B, of an alternating electromagnetic field, 
etc.), and electromechanical vibrations (vibration of the dia- 
phragm of a telephone receiver, of the diaphragm in an elec- 
tromagnetic loudspeaker, etc.) and others. 

The system executing the oscillation (or vibration) is called 
the oscillatory (vibratory) system. Free (natural) oscillations are 
ones that occur in the absence of externally applied variable 
action on the system. They are due to some initial deviation 
of the system from a state of stable equilibrium. Forced oscil- 
lations (vibrations) are ones set up in a system as a result of 
variable external effects (for instance, current oscillation in an 
electric circuit caused by a varying emf, swinging of a pendulum 
due to varying external forces, etc.). 

28.1.2 Oscillations (vibrations) are said to be periodic if all the 
physical quantities that characterize the oscillatory (vibra- 
tory) system and vary. during its oscillation are repeated in 
successive time intervals. The minimum time interval 7 com- 
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plying with this condition is called the period of oscillation 
(or vibration). This is the time required for one complete oscil- 
lation (vibration) of the system. The frequency of periodic os- 
cillations (vibrations) is the quantity v = 1/T, equal to the 
number of complete oscillations (vibrations) executed by the 
system in unit time. The angular, or circular, frequency is the 
quantity © = 2nv = 2n/T, which is equal to the number of 
complete oscillations accomplished in 2x units of time. 
28.1.3 In periodic oscillations the dependence of a varying 
quantity s on the time ¢ complies with the condition s (t + T) = 
= s (ft). 

The periodic oscillation of the quantity s (t) is said to be har- 
monic if 


s (t) = A sin (Ot + Po) or s(t) = A cos (0t -+ Qi), 


where © = 2nv = (2n/T) = const is the angular, or circular, 
frequency of harmonic vibration, A = Smax = const > 0 is the 
maximum value of the varying quantity s and is called the 
amplitude of oscillation, and o and g, = ẸPọ — 7/2 are con- 
stant quantities. The value of s at some arbitrary instant of 
time ¢ is determined by the phase of the oscillation ® (t) = 
= wt + Po [and, respectively, V, (t) = ®t + Qı]. The quanti- 
ties Ẹọ and q, are the initial phases of the oscillation, i.e. the 
values of @(¢) and Q, (¢) at the instant (t = 0) of time refer- 
ence: Po = ® (0) and g, = ®, (0). 

28.1.4 The first and second time derivatives of a harmonically 
varying quantity s (t) also vary harmonically at the same angu- 
lar frequency: 


<= Ao cos (ot + Po) = AO sin (otot) ' 
2 
Lan = — Aw? sin (@t + Po) = Aw? sin (Wt + Po +1), 


where the amplitudes of ds/di and d?s/dt? are equal, respectively, 
to Aw and Aw*. The initial phase of ds/dt is equal to (Qo + 1/2), 
i.e. the phase difference of the variations of ds/dt and s is con- 
stant and equal to 7/2 (the quantity ds/dt has a phase lead of 
m/2 with respect to s). The initial phase-of d?s/dt?.is: equal. te 
(Po + x), i.e. the phase difference of the variations of ‘d®s/dt? 
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and s is constant and equal to a (the quantity d?s/dt? has a 
phase lead of x with respect to s). Curves of the dependence on 
the time ż of the quantities s, ds/di and d?s/dt? for harmonic varia- 
tions and with ọọ = 0 are shown in Fig. 28.1. 


ds d@’s 
oat’ al? y= 0 


Fig. 28.4 


28.1.5 It is evident from the second equation in Sect. 28.1.4 
that a harmonically varying quantity s satisfies the differential 
equation 


d*s 

qe Os 0. 

The general solution of this equation is of the form 
s = A, sin Ot + A, cos Ot, 


where A; and Ag are arbitrary constants of integration. The 
values of A; and A, can be found from the initial conditions, 
i.e. when the values of s and ds/dt are known for the initial in- 
stant of time {t = 0). Thus 

1 ds 
A= sa (Fh and A, =s (0). 
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The general solution can be transformed to the standard form 
for harmonic oscillations (vibrations) given in Sect. 28.1.3: 


s=A sin (Oz F Po), 


where A= V A?+ 42 and @)=arc tan (4/41). 
Thus, the quantity s has harmonic variations if and only if 
it satisfies the differential equation given at the beginning of 
this subsection. This equation 
is called the differential equation 
of harmonic variations (or vibra- 
tions or oscillations). 
28.1.6 Harmonic oscillations (vi- 
brations) can be represented 
graphically by means of a vec- 
tor in a plane. For this purpose 
vector A is drawn from the ori- 
gin of coordinates O in the plane 
XY (Fig. 28.2). The magnitude 
of vector A is equal to the am- 
peut A of the oscillation 
eing considered andthe vector Fig. 28.2 
makes the angle ọ = œt + Por 
equal to the phase of the oscilla- : , 
tion at the given instant of time ż, with the coordinate axis OX. 
In the course of time angle ọ increases so that vector A rotates 
at a constant angular velocity about point O equal to the angular 
frequency of oscillation (vibration) ©. The e poe E pro- 
jection of vector A on the vertical axis OY varies harmonically 
according to the relation 


A, = s = A sin (ot + Ẹo). 


The graphical representation of harmonic vibrations (or oscil- 
lations or variations) by means of a rotating amplitude vector 
is ealled the vector diagram method. It is extensively applied, 
for instance, in adding together harmonic oscillations having 
the same direction (28.4.2). 

28.1.7 According to the Euler formula for complex numbers, 


eif = cos ọ + ising, 


where i = V —1. Hence the harmonic oscillation (vibration) 
s = A sin (wt +- Q) = A cos (Ot + pı), where pı = Po — 7/2, 
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can be written in the exponential form 


fate i(wt - 
s =: Ae’ — Aet 4 01), 


where A = Aeip1 is the complex amplitude. Only the real part 


of the complex function s, denoted by Re {s}, has a physical 
meaning: 


Re {s$} = s = A cos (@t + Qı) = A sin (0t + po), 
where @p = 9, + 7/2. 


28.2 Mechanical Harmonic Vibrations 


28.2.4 If a particle vibrates harmonically in a straight line 
along the coordinate axis OX about the equilibrium position, 
taken as the origin of coordinates, the dependence of the coor- 
dinate z of the particle on the time ż is of the form given in 
Sect. 28.1.3 (with s = x). Thus 


x = A sin (®t + @Q). 


The projections of the velocity v and acceleration a of the par- 
ticle on the OX axis are: | 


Uy = Vo COS (Ot 4- Po) and a, = — ap sin (Mt A- Qo), 


where vy = Aw is the amplitude of the velocity, and aj = 
== AW? = v% is the amplitude of the acceleration. 
The force F acting on the particle equals 


F = ma and F = — mw*z, 


where m is the mass of the particle. Consequently, force F is 
proportional to the displacement of the particle from the equi- 
librium position and opposite in direction: 


F = — mo2zi, 


where i is a unit vector of the OX axis. 

Such a dependence of the force on the displacement is typical 
for an elastic force (3.3.6). Hence, forces of other physical ori- 
gin, satisfying the same kind of relationship, are said to be 
quasi-elastic forces. 
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28.2.2 The kinetic energy of a particle vibrating harmonically 
in a straight line is equal to 


242 
cos? (ot + Po) = aoe cos? (œt -+ Po), 


mv? mv 


Ek=-3 2 


or 


m 


w? A? 
Ex ae co [1+ cos (2@¢+ 2@,)]. 


The kinetic energy of the particle varies periodically from 0 
to mw?A2/2 as it vibrates harmonically with the angular fre- 
quency 2m and the amplitude mw?A2/4 about its average value 
of mw?A?2/4. 

The potential energy of a particle vibrating harmonically in a 
s raight line due to the effect of a quasi-elastic force is equal to 


x 
2 2 2 A2 

Ep = — \ Fy dz = no =m Sin? (wi-+ Qo), 
0 

or 3 
2 A2 

Ep= mo” [eos (20t + 29,)] 

ma? A2 


=T [1+ tos (20t +20 +1)]. 


The potential energy of the particle varies periodically from 
0 to mœ?A?/2 as it vibrates harmonically with the angular fre- 
quency 2 and the amplitude mw?A2/4 about its average value 
of mm?A?/4. The variations of the potential and kinetic energies 
have a phase difference of n, so that the total mechanical energy 
of the particle remains constant in vibration: 


2 42 
E=E,+ Ep = mo a’ const. 


Curves of the dependence of Ey, Ep and E on the time ż for the 
case when ọọ = 0 are shown in Fig. 28.3. 


368 a FREE HARMONIC OSCILLATIONS [28 


28.2.3 Example 1. A linear harmonic oscillator is a particle 
of mass m having harmonic vibration along a straight line by 
the action of elastic force Fe] = — kzi (3.3.6). Serving as an 
example of such a system is the spring pendulum, where a weight 
of mass m is suspended by an absolutely elastic spring (k is a 


fr, Ep, E 


Ie t 


Fig. 28.3 


factor specifying the elastic properties of the spring). The mo- 
tion equation is 


k 
m ae on —Èzr or e E z=0. 


It follows from Sect. 28.1.5 that an oscillator (spring pendulum) 
has harmonic vibration according to the equation z = 
=A sin(wt-+ Po) with an angular frequency œ and period T 
equal to 


Ai i and -[—=2n Vy =. 
m , k 


The potential energy of a linear harmonic oscillator is 
kx? 

Ep= 5 

28.2.4 Example 2. A compound, or physical, pendulum is a 

rigid body capable of oscillating under the action of the grav- 

ity force mg about a horizontal axis O, which does not pass 

through the centre of gravity of the body, and is called the azis 


28.2] MECHANICAL HARMONIC VIBRATIONS 369 


vf oscillation, or swing (Fig. 28.4). The centre of gravity of the 
pendulum coincides with its centre of mass C (7.3.2). Point 
O, the intersection of the axis of 
oscillation with a vertical plane 
passing through the centre of 
vravity of the pendulum and per- 
pendicular, to its axis of oscilla- 
lion, is called its point of suspen- 
sion. 

In the absence of friction forces 
in the suspension, the motion equa- 
lion of the pendulum is of the 
lorm (4.3.4) 


d?a TAR 
eg — mgd SIn a, 


where œ is the angle of devia- 
tion of the pendulum about its 
axis of oscillation from the equi- 
librium position, d= OC is the = 

distance re the centre of mass Fig. 28.4 

of the pendulum to its axis of 

oscillation, J is the moment of inertia (4.2.1) about the same 
axis, m is the mass of the pendulum, and g is the free fall 
acceleration. Upon small oscillations of the pendulum sina x & 
and the motion equation of the pendulum is of the form 


Pa | med c=0 / 

AE EOE A 

i.e. angle a satisfies the differential equation of harmonic vi- 
brations (28.1.5). Hence, in the absence of friction, small 
oscillations of a compound pendulum are harmonic: 


x == Ag sin (wt -+ Po), 

where & is the amplitude of variation of angle a, and 
ai mgd TE. I 

0 = p and T =2n aa 


are the angular frequency and period of small oscillations of 
a compound pendulum. 


24--0186 
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28.2.5 Example 3. A simple pendulum is a particle suspended 
by a weightless, inextensible thread and oscillating in a vertical 
plane by the action of the gravity force. The simple pendulum 
is the limiting case of the compound (physical) pendulum, whose 
whole mass is concentrated at the centre of mass, so that d = l, 
the length of the simple pendulum. The moment of inertia of 
a simple pendulum with respect to its axis of oscillation is 
I = ml?. Correspondingly, the angular frequency and period 
of small oscillations of this pendulum are 


omy = and r—ony L. 
l rg 


Small oscillations of compound and simple pendulums are 
examples of isochronous vibrations, i.e. vibrations whose fre- 
quency and period are independent of their amplitude. 

In the general case, the period of a compound pendulum depends 
upon its amplitude a). Thus 


T =27 V4 [1+ (5) sine 2 


2 
4 3)\2 a 
wiles — © —— z] 4 0 
+ (5 z) sin 5 pics, 


The variation in the value of T when @, is increased to 15° does 
not exceed 0.5 per cent. 
28.2.6 The equivalent lengih ly of a compound pendulum is 


the length of a simple pendulum that has the same period of 
oscillation. Thus 


I I 
eo Oe 


where Iç is the moment of inertia of the compound pendulum 
with respect to an axis passing through the centre of mass C 
of the pendulum and parallel to its axis of oscillation. Point 
Q,, lying on the straight line OC at the distance of leg from the 
point of suspension O (Fig. 28.4), is called the cenire of oscilla- 
tion of the compound pendulum. The centre of oscillation O, 
and the point of suspension O are interchangeable: if the pen- 
dulum is suspended so that its axis of oscillation passes through 
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point O,, then point O will coincide with the new position of 
the centre of oscillation, i.c. the equivalent length and period 
of oscillation of the pendulum remain unchanged. 

28.2.7 Example 4. Small free vibrations of electrons in a plasma 
(22.6.1) are called Langmuir oscillations of the plasma. They 
are caused by the forces of an elec- 
tric field, which is set up in an elec- 
trically neutral plasma as the result 
of some random deviation of the 
spatial distribution of the electrons 
from their equilibrium distribution. 
Assume that in a flat layer of plasma 
of thickness l (Fig. 28.5) the elec- 
trons are displaced over the small 
distance s along the positive direc- 
tion of axis OX. This leads to the 
formation of a surplus positive charge Fis. 28.5 

at the left-hand part of the layer and va eee 

a surplus negative charge at the 

right-hand part. Consequently, an electric field is set up, whose 
strength E is along the OX axis and its projection on this axis 
is Ey = noes/@g, where ng is the electron concentration in the 
plasma, e is the magnitude of the charge of the electron, and 
& is the electric constant. According to Newton’s second law, 
the motion equation for the electrons of the plasma in this elec- 
tric field is of the form: 


ds e* No d?s eno 
z= eh, = ———s or al 
dt Eq dt ME, 


\ 


m s=0, 


where m is the mass of the electron. Thus, the electrons of the 
plasma have free harmonic vibrations at the angular frequency 


n 
O= e = 
MEg 


which is called the Langmuir plasma frequency, or simply, the 
plasma frequency. 


OAR 
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28.3 Free Harmonic Oscillations in an 
Oscillatory Electric Circuit 


28.3.1 An example of an electric circuit in which free electric 
oscillations may occur is the simple oscillatory circuit (Fig. 28.6). 
It consists of a capacitor of capacitance C connected in series to 
an induction coil with the self-induc- 

I>0 tance L. When the previously charged 
capacitor is connected to the coil, 
free oscillations of the capacitor charge 
and the current in the coil are pro- 


È 7 L duced. This sets up a variable elec- 
g tromagnetic field that propagates 
through space at a velocity. equal to 

that of light. Hence, if the linear di- 

Fig. 28.6 mensions 1 of the circuit are not 


too large (L& c/v, where c = 3 x 

x108 m/sis the velocity of light in 
vacuum, and v is the frequency of oscillation in the circuit), 
it can be assumed that at each instant of time ż the current I 
is the same at all parts of the circuit. Such a variable current is 
said to be quasi-stationary. According to Ohm’s law (24.2.3) 
for the portion 1-L-2 of the circuit (lig. 28.6) 


a q dI 
[R=V,—V.+ és or [R= —-p— L -ir > 
Here q and V, — V = — q/C are the charge of the capacitor 


and the potential difference of its plates at the arbitrary instant 
t of time being considered, R is the electric resistance of the 
oscillatory circuit, i.c. of the part 1-L-2 of the circuit, and 
6; = —L (dI/dt) is emf of self-induction in the coil (25.2.4). 
If follows from the law of conservation of electric charge (14.1.3) 
that the quasi-stationary current in the circuit is J = dgq/dt. 
Therefore, the differential equation for the variation of charge 
q is of the form 
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28.3.2 Free electric oscillations in an oscillatory circuit are 
harmonic if its electrical resistance R = 0. Thus 
d?q 1 


Fea ae 


The angular frequency œ and the period 7’ of these oscillations 
satisfy the Thomson formula: 


0a — 


A and T=2nVLC , 
V LC 
‘The charge q of the capacitor and current J in the circuit vary 
according to the equations 


=o Sin (@t-+ P) and 


| =I) cos (wt + Go) = Io Sin (ot-+m+—-} , 


where g) is the amplitude of the capacitor charge, Io = @q) = 
= q/ V LC isthe amplitude of the current, and gp is the ini- 
tial phase of capacitor charge oscillation. The current in the 
circuit has a phase lead of n/2 over the capacitor charge. 

‘The potential difference u = V — V, of the capacitor plates 
also varies according to a harmonic law and is in phase with 
charge q. Thus 


w= Uy sin (wt -+ Go), 


where Uy == q,/C is the amplitude of the potential difference. 
The amplitude of the current is 


T 
l= a V +. 


The quantity V L/C is called the characteristic, or wave, impe- 
dance of the oscillatory circuit. 

28.3.3 With free harmonic oscillations in an oscillatory circuit, 
there is a periodic transfer of the energy Z, of the electric field 
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of the capacitor into energy £,, of the magnetic field of the in- 
ductance coil and back again. Thus 


ee ee g sin? (wt + jai [1— cos (20t + 2Po)] 
SOC 2C mes AE a 
2 2 2 
i i = = cos? (wt + o) = a [1-+ cos (2wt-++ 2q,)]. 


For this reason, the oscillations in an electrical oscillatory cir- 
cuit are frequently said to be electromagnetic oscillations in the 
circull. 

The values of /, and Em vary in electromagnetic oscillations 
within the limits from 0 to their maximum values, equal, re- 
spectively, to q3/2C and LJ3/2, in which q3/2C = LI?/2. The 
variations of Fẹ and Em are displaced in phase. At the instants 
of time when FE, = 0, Em = Em may = LI?/2. On the other 
hand, when Em = 0, Ee = Ee max = 93/2C. The total energy 
of electromagnetic oscillations is constant with time. Thus 


| 2 LI? 
E =- Eet Em = SL at = const. 


28.4 Adding Harmonic Oscillations 


28.4.1 The addilion of oscillations (vibrations or variations) 
consists in finding the equation for the resulting oscillation of 
a system when this system simultaneously participates in several 
oscillatory processes. Distinction is made between two limiting 
cases: the combining of oscillations having the same direction 
and the combining of mutually perpendicular oscillations. The 
first case corresponds, for example, to the vibration of weight 7 
(Fig. 28.7) which vibrates with respect to weight 2 on spring a, 
and together with weight 2 on spring b. This same situation 
occurs in superposing the variations of scalar physical character- 
istics of an oscillatory or variable system (pressure, temperature, 
density, electric charge, current, etc.). 

28.4.2 Two harmonic oscillations having the same direction, such 
as sı = A, sin (@,t -|- g;) and sy = Ag sin (Mgt -+ Q2) can be 
added together by using the vector diagram method (28.1.6). 
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Shown in Fig. 28.8 are the vectors A; (t) and A, (t) of the first 
and second oscillations at the arbitrary instant ¢ of time when 
the phases of these oscillations equal: @, (t) = œt + q, and 
(Dy (t) = Wet + P2. The resultant oscillation s = sı -+ s, cor- 


e 


O o a to! 
D(t) blt) &,(t) 
Fig. 28.7 Fig. 28.8 
responds to the vector A (t) = A, (t) + A, (t) whose projec- 
tion on the vertical axis OY is equal to s. Thus 
s = A (t) sin © (2). 
According to the cosine law 
[A (8)? = A? + A? + 24,4, cos [P, (t) — P, (2), 
and 
A, sin @, (¢)-+ A, sin @, (t) 
P _ “4 1 2 2 
pa) A, cos P, (¢)-+ A, cos PD, (t) ° 


28.4.3 Two harmonic oscillations sı and s, are said to be coherent 
if their phase difference is independent of time. Thus 


L lD, ()—®, ()] = 0 and D (f)—®, () = const. 
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Since Pa (t) — D, (1) = (© — 01) t+ (P — Gy), the angular 
frequency of coherent oscillations shou d be the same, i.e. 
@, = ©, = ©. At any instant of time the phase difference of 
coherent oscillations is equal to the difference of their initial 
phases: MD, (t) — Pı (t) = Pa — Pı. Consequently, the resultant 
oscillation is harmonic and has the same angular frequency ©, 


j.e. 
s = sı + s2 = A sin (Ot + Qo), 


where 

A? = A} + AZ + 2414o cos (Po — Fj) 
and 

tana A, sin @, +A, Sin P2 


A, COS Pi + Az COS Py ’ 


In accordance with the values of the initial phase differences 
of the oscillations being added together, the amplitude A of the 
resultant oscillation varies within the limits 


from A = | Ay — Ag| at OD —GQ = t2m+ i)n 
to A = Ay -+ As at Do == Pı = +2mī 


where m = 0, 1, 2, ... is any integral nonnegative number. 
When Ọs — Q; = +2mn, the oscillations being added are said 
to be en (or simply, in phase), whereas at @. — Q; = 
= + (2m +4- 1) x they are said to be antiphased (or in opposition). 
28.4.4 Harmonic oscillations of different frequency (©, =4 ©,) 
are incoherent because their phase difference, equal to 
(O — @,) t+ (P2 — Pı), varies continuously with time. If 
such oscillations are superposed, an anharmonic resultant 
oscillation is obtained. The amplitude vectors A; and A, of the 
added oscillations (Fig. 28.8) rotate at different velocities, so 
that the parallelogram constructed on the vectors is continuously 
deformed and its diagonal, vector A of the resultant oscillation, 
varies in length and rotates at variable angular velocity. 

Two harmonic oscillations having different angular frequencies 
œ% and © can be approximately considered to be coherent only 
during the time interval Aż in which the phase difference of these 
oscillations varies only slightly: | ©, — o, | At & 2n or Ai< 
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< Teoh, Where Teon = 2n/| © — @,| is the coherence time 
of the oscillations being considered. 

28.4.5 Anharmonic oscillation obtained as a result of super- 
posing two harmonic oscillations having the same direction and 
slightly differing frequencies (| © — ©, | < ®,) produces what 
are called beats. It proves convenient to take as the time reference 
L the instant when the phases of the two oscillations being super- 
posed, sı and s» coincide and are equal to pọ. Then s = 
= A, sin (@,¢ + Po) and So = Ag sin (Dyt 4- Po) = 
= Áa sin [@,¢-+ Po + @ (t)], where @ (1) = (©, — œ). The 
resultant oscillation satisfies the relation 


s = A (t) sin [®t + Po + (I, 


where 

[A ()]? = 42 + AZ + 2414, cos @ (t) 
aud 

tan w(t) = aoe) 


A, +A, cos ọ (t)? 
In particular, when A; = A, = Ao, 


l(t) = 2A, cos eh t and w*(t)= 
so that 


Ws — . 0): — W 
s =2A, COS a t sin (ae I+ @o | . 


The quantity | A (¢) |, characterizing the swing of the oscilla- 
lions in beating, varies within the limits from | 4; — A. | to 
A, -+ A, with the angular frequency Q = | 0, — a, |, called 
the angular beat frequency. Since the beat frequency is much 
lower than the frequency of oscillation (Q < œ), the variable 
quantity | A (4) | is conditionally called the beat amplitude. 
The beat period Tp and the beat frequency vp are equal to 


27 27 4 


e Taol E 


E ee 


Tp eee 
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and 


Vh = =: | vVo— vı l, 


1 
Th 
where 73, vı, Ta and vg are the periods and frequencies of the 


superposed oscillations. Curves of the dependence of s on ¢ in 
beating are shown in Fig. 28.9 for the case when A, = Az = Ao. 


S 


al vel thal 


sion CN ult 


Fig. 28.9 


28.4.6 As a result of the superposition of harmonic oscillations, 
coinciding in direction and having angular frequencies that are 
integral multiples of their lowest frequency, i.e. ©, 20, 30, etc., 
a periodic nonharmonic oscillation with the period T = 2n/o 
is obtained. On the other hand, any complex periodic oscillation 

-- f(t) can be represented as the sum of simple harmonic 
bseillanione having angular frequencies, which are teers 
multiples of the basic angular frequency © = 2n/T, where T is 
the period of the oscillation. Thus 


$2) (i) == aa + 5 (an CoS nWE-+ bp sin not) 


n==1 


2 Pn) , 
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where 
a ote 
ty = = | f (t) cos not dt (n=0, 1,2,...), 
-T72 
T/2 
Un = f (t) sin not dt” (n=1, 2,...). 
=T/3 


Such a representation of the periodic function f (t) is called the 
expansion of this function into a Fourier series, or the harmonic 
analysis of a complex periodic oscillation. The terms of the Fourier 
series corresponding to the harmonic oscillations with the angu- 
lar frequencies ©, 2, 30, etc. are called the first, or fundamental, 
second, third, etc. harmonics of the complex periodic oscillation 
s = f (t). The whole set of these harmonics forms the spectrum 
of the oscillations s = f (t). The composition of the spectrum 
depends on the form of the periodic function s = f (t). In the 
simplest cases, the spectrum can consist of a small number of 
harmonics. 

The oscillation spectrum is often understood to be the frequency 
spectrum, i.e. the set of frequencies of simple harmonic oscil- 
lations that can be superimposed to obtain the complex oscil- 
Jation being considered. Periodic oscillations have discrete 
(line) frequency spectra. 

28.4.7 Nonperiddic oscillations have, as a rule, continuous fre- 
quency spectra, i.e. they can be represented as the result of the 
superposition of a great many harmonic oscillations, whose 
frequencies have all possible values in a certain range (from 0 to 
co in the general case). The harmonic analysis of such oscil- 
lations consists in presenting them in the form of a Fourier 
inlegral: 


sf (1) = \ [a (w) cos wt-+ b (œ) sin wt] do, 
0 


where 


8 


w(o)= -i | Ocata, o(o=— | 7 @ sin otag, 


-8 f SA 
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Certain nonperiodic oscillations, called almost periodic (quasi- 
periodic) oscillations have a line frequency spectrum. But the 
included angular frequencies are incommensurable with one 
another, because their ratios are expressed by irrational numbers. 
28.4.8 Modulation is the change of any parameter of a periodic 
oscillation (for instance, the amplitude or frequency) according 
to a definite law during a length of time considerably longer 
than the period of the oscillation. 

In amplitude modulation, for example, of the harmonic oscillation 
s = Ag sin (Wot + Po), the modulated oscillation is of the form 


s = Ay [1+ b (H) sin (Oot + Po), 


where | b (i) | < 1. 
If amplitude modulation is carried out according to the harmonic 
law b (t) = by cos Qt, where bọ = const and Q < Wy, then 


s = Ag (1 + bo cos Qt) sin (Wot -+ Qo). 


This modulated oscillation has a line frequency spectrum because 
it can be represented as the sum of three harmonic oscillations 
with the angular frequencies œp, ®) — Q and @) + Q, and 
amplitudes equal, respectively, to Ao, Agby/2 and Aybo/2. Thus 
Ay (1+ bo cos Qi) sin (yt + Po) 

Agba 
= Áo sin (Oot + fo) + 5 2 {sin [(0 +2) t+ Go] 
+ sin [(@)—Q) t+ Pol} 


In frequency modulation of the harmonic oscillation s = Ag X 
X sin (Mot + Po), carried out according to a harmonic law, the 
modulated oscillation is of the form 


s = Ay sin [@ (1 + bo cos QL) t+ Pol, 
where bọ < 1 and Q < ©. 


Accordingly, in phase modulation, the initial phase of the 
oscillation is changed. Thus 


s = Ag sin (Mi + Aq cos Qt), where Q < Wy. 


In the general case oscillations can be modulated simultancously 
in amplitude and in phase (or frequency). An example of such 
modulated oscillations is beating (28.4.4). 
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28.4.9 Addilion of mutually perpendicular harmonic oscillations 
of equal frequency. Assume that point M oscillates simultaneous- 
ly along the coordinate axes OX and OY according to the law: 
x = A, sin (Ot + @,) and y = A, sin (Wt + Po), where z and 
y are the Cartesian coordinates of point M. The path equation 
for the resultant motion of point M in plane XOY can be found 
A eliminating parameter ¢ from the expression for z and y. 
Thus 


xr? y? 2Y 


AP a aA (Pa — p1) = sin? (F2 — P1) 


This path is an ellipse (Fig. 28.10) and point M describes this 
ellipse during a time equal to the period of the oscillations being 


Fig. 28.40 


added together (T = 2n/). Hence, the resultant motion of 
point M is called elliptically polarized oscillation. 

The orientation of the axes of the ellipse in plane XOY, as well 
as its dimensions depend upon the amplitudes A, and A, of the 
oscillations being combined and on their initial phase difference 
Pa — Pı- IÉ Pa — P1 == (2m + 1) n/2, where m = 0, +1, +2,..., 
the axes of the ellipse coincide with the coordinate axes OX and 
OY, and the lengths of the semiaxes are equal to the amplitudes 
A, and A>. Thus 


r2 y? 


#1. 
A | AB 
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If, in addition, A, = Ag, the path of point M isa circle. Such 
resultant motion of point M is called circularly polarized oscil- 
lation. 

When og, — g, = mn, where m = 0, +1, +2,..., the ellipse 
degenerates into a straight line. Thus 


The plus sign corresponds to even values of m, i.e. to the addition 
of cophased oscillations (Fig. 28.11a); the minus sign, to odd 
values of m, i.e. to the addition of oscillations that are in 


(te 1a ayaa 


opposition (antiphased) (Fig. 28.116). In either case, point M 
executes linearly polarized oscillation. It oscillates with harmonic 
motion at the frequency of the oscillations being combined and 
with the amplitude A = y A? + 42 along a straight line that 
makes the angle a = arc tan (A, cos mm/A,) with the axis OX. 
28.4.10 In the addition of mutually perpendicular oscillations 
having the angular frequencies pœ and qœ, where p and q are 
whole numbers, we obtain 


x = A, sin (pot + Qı) and y = Ag sin (q@t + Pa). 


The values of coordinates x and y of the oscillating point M are 
simultaneously repeated after equal periods 7, of time, equal 
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to the least common multiple of 7, = 2n/p® and T, = 2n/go, 
which are the periods of oscillation along the OX and OY axes. 
Hence, the path of point M is a closed curve whose shape depends 
upon the ratios of the amplitudes, frequencies and initial 
phases of the oscillations being combined. Such closed paths of 
point M, which has simultaneous harmonic oscillations in two 
mutually perpendicular directions, are called Lissajous figures. 


Lissajous figures are inscribed into rectangles whose centre coin- 
cides with the origin of coordinates and whose sides are parallel 
to coordinate axes OX and OY. The distances of these sides from 
the axes are equal, respectively, to A and A,. The ratio of the 
frequencies q% and p® of the combined oscillations is equal to 
the ratio of the number of times the corresponding Lissajous 
figure touches a side of the rectangle parallel to the OX axis to 
the number of times it touches a side parallel to the OY axis. 
Shown in Fig. 28.12 are the Lissajous figures for three different 
values of the ratio q/p (2:1, 3:2 and 4:3) and for initial phase 
differences of Ag = @; — P = 1/2. 
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CHAPTER 29 DAMPED AND FORCED OSCILLATIONS 


29.1 Damped Oscillations 


29.1.1 The damping of an oscillation or vibration is its gradual 
weakening in the course of time due to the loss of energy of the 
oscillatory (vibratory) system. Free oscillations (vibrations) 
of real systems are always damped. The damping of free mecha- 
nical vibrations is due chiefly to friction and the excitation of 
clastic waves (30.1.3) in the surrounding medium. Damping in 
electrical oscillatory systems is due to thermal losses in the 
conductors making up the system or located in its variable 
electric field, energy losses on the radiation of electromagnetic 
waves (31.1.1), and heat losses in dielectrics and ferromagnetic 
materials as a result of electrical and magnetic hysteresis 
(18.4.5 and 26.5.2). 

The damping of oscillations depends upon the properties of 
the oscillatory system. A system is said to be linear if the para- 
meters characterizing physical properties of the system, essential 
in the process being considered, do not change in the course 
of the process. Linear systems are described by linear differential 
equations. A spring pendulum (28.2.3), for instance, moving in 
a viscous medium, is a linear system if the resistance coefficient 
of the medium and the elasticity of the spring are independent 
of the velocity and displacement of the pendulum. An electrical 
oscillatory circuit (28.3.1) can be considered linear if its elec- 
trical resistance R, capacitance C and inductance L are indepen- 
dent of the current in the circuit, and the voltage. In the majority 
of cases real oscillatory or vibratory systems are sufficiently 
close, with respect to their properties, to linear ones. 

29.1.2 The differential equation for the free damped oscillation of 
a linear system is of the form: 


d2s ds 
-+- 28 — + ogs = 
dt? H2 dt O0 n 


Here s is the physical characteristic of the system that varies 
in the oscillation, B = const œ> 0 is the damping factor, and 
© is the angular frequency of free undamped oscillation of the 
same system, i.e. in the absence of energy losses (at B = 0). 
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xample 1. Free damped oscillation of a spring pendulum (28.2.3). 
A pendulum of mass m, oscillating in a straight line along the OX 
axis due to the elastic force of its spring, is also subject to a 
resisting force Fres = —bv, where v is the velocity of the pen- 
dulum and b = const > 0 is the resistance coefficient. The differ- 
ential equation of free damped oscillation of the pendulum is 


dx dx dèx Ax 
eee hs SG ee te i, ene ee 2 ee 
m se b Ji kx or FTE + 28 T +ozr=0, 


where B = b/2m, © = V k/m, and k is a factor specifying the 
clastic properties of the spring. 

Example 2. Free damped oscillation in an electric oscillatory 
circuit. The electrical resistance of a real circuit R = 0, and the 


differential equation of the oscillation in the circuit (28.3.4) is 
of the form 


dq dq 
ae ag OSN 


where B = R/2L and © = 1/V LC. 


29.1.3 If damping is small (B < ©) the dependence of s on £ 
pene the equation of damped oscillation (29.1.2) is of 
the form 


s = Age Pt sin (wt + wo). 


Here © = y o2 — p? and the constant quantities A, and pọ 
depend upon the initial conditions, i.e. upon the values of 's 
and ds/dt at the initial instant of time (¢ = 0). The curve of 
the dependence of s on ¢ at p = 0 is shown in Fig. 29.1. ~ 
Damped oscillations are not periodical ones (28.1.2). For exam- 
ple, the maximum value of the varying quantity s, reached 
at a certain instant ¢, of time, is never repeated subsequently 
(at t > 7t,). But the quantity s becomes zero in damped oscil- 
lation as it varies in a single direction (decreases, for instance) 
and also reaches its maximum and minimum values after 
equal time intervals. Thus 


2 27 
o Vof 


25—0186 


T = 
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For this reason, the quantities 7 and © are conditionally called 
the period (conditional period) and angular frequency (conditional 
angular frequency) of damped oscillation. 

The quantity 


A = Aeh! 


is called the amplitude of damped oscillation, where Ag is the 
i nitial amplitude. The amplitude of damped oscillation de- 


Fig. 29.4 


creases in the course of time; the greater the damping factor f, 
the more rapidly the amplitude decreases. 

The time interval t = 1/B during which there is an e-fold de- 
crease in the amplitude of damped oscillation, is called the 
relaxation time. 

29.1.4 The logarithmic decrement of damping is the dimensionless 
quantity 5, equal to the natural logarithm of the ratio of the 
amplitudes of damped oscillation at the instants of time ¢ and 
t+ T (where T is the conditional period of oscillation). Thus 


where N is the number of oscillations in which the amplitude 
is reduced e-fold. 
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The relation between the angular frequency w of damped oscil- 
lation of a system and the logarithmic decrement of damping 
6 is 


o=a VY 1—-(2)" (=) 


29.1.5 The Q factor (or quality factor or storage factor) of an 
oscillatory system is the dimensionless quantity Q, equal to the 
product of 2x by the ratio of the energy of oscillation ŒE (t) of 
the system at the arbitrary time instant ¢ to the decrease of this 
energy during the length of time from ¢ to ¢-++ T, i.e. during 
one conditional period of damped oscillation. Thus 


= E (t) 
= 20 RF ELT)’ 
Since the energy E (t) is proportional to the square of the am- 
plitude A (t) of the oscillation, 
B A? (t) a 
C= en -EAF e BT Te 
At low values of the logarithmic decrement of damping 6, the 
Q factor of an oscillatory system is Q = 1/6. Here the con- 
(ditional period of damped oscillation 7 is practically equal to 
the period T, of free undamped oscillation and, consequently, 
Q = n/T = 0/28. For example, the Q factor of an electrical 
oscillatory circuit (28.3.1) is Q = V LICIR, and that of a 
spring pendulum (29.4.2) is Q = V km/b. 
29.1.6 The conditional period of damped oscillation increases 
with the damping factor B and becomes infinite at P = œp. If 
P > Op the differential equation of motion of the system 
d2s ds 
E NOG E eg 

TOE 2B gr T 28s 0 


has the following general solution: 
s= Cye Ba + Coe ae 


where a, = B+ Vp? — o}, ag = B — V B? — of, and C, 
and C, are constant factors depending upon the initial con- 


20 ¥ 


388 DAMPED AND FORCED OSCILLATIONS [29 


ditions. If the initial values (at the instant of time ż = 0) equal: 
s = sọ and ds/di = vo, then 


QoS V 1S V 
Gi des SO, E E ELN 


Such motion of the system is not of an oscillatory nature and 
is said to be aperiodic. Depending upon the initial conditions, 


S 


So 


Fig. 29.2 


aperiodic motion of the system may be either of two kinds 
(Fig. 29.2). Type a motion occurs when sọ and v are opposite 
in sign and | vo | > & | sọ |. Otherwise, the aperiodic motion 
is of the b type 


29.2 Forced Mechanical Vibration 


29.2.1 A variable external force, applied to a system and causing 
its forced mechanical vibration, is called the disturbing, or 
driving, force. 

The differential equation of forced vibration (or oscillation) of the 
simplest linear system (29.1.1), the spring pendulum (29.1.2), 
executed along the OX axis due to the action of the variable 
external force F (t) is: 


d?r dx 4 
ga CG ets Fe (). 
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If F,. (t) is a periodic function of time, then, after application 
of the force to the pendulum, the forced vibration begins with a 
fransient condition. The pendulum simultaneously participates 
in two vibrations : 


x = zı (t) + z (t). 


The first term corresponds to free damped oscillation* of the 
pendulum n 1.3): 


(2) = Age™ sin (t + yy), 


where © = V og — f?. 

The second term corresponds to the undamped periodic oscil- 
lation of the pendulum at a frequency equal to that of the 
driving force F, (t). 

‘he amplitude value of xı (t), equal to A, exp (—f2), is reduced 
al a more or less rapid rate after the beginning of the forced 
vibration (or oscillation). During the time t = 4.6/B the am- 
plitude of x, (t) decreases by a factor of 100. Consequently, after 
a certain time t following the beginning of vibration (t ZS To), 
tree vibration of the pendulum practical ily ceases: x (t) % z (t). 
The pendulum reaches a state of steady forced vibration (or oscil- 
lation) at the frequency of the driving force. 

29.2.2 When the driving force varies according to a harmonic 
law, i.e. Fy = Fo cos Qt, the steady-state forced vibration of 
the pendulum is also harmonic at the same frequency. Thus 


t= A cos (Qt + Qo). 
The amplitude A of this vibration and the phase shift @) between 
(he displacement and the driving force depend upon the relation 


between the angular frequencies of the forced vibration (Q) and 
(he free undamped vibration (œo) of the pendulum. Thus 


ro and tan o= — ape 


A. —— a er 6 
m V (o — 2)? + 422? oo ne 


* It is assumed that B < ©. Otherwise, the free motion of the 
pendulum would be aperiodic (29.1.6), i.e. 


r, (t) = C exp [—(B+ V B?— o2 ) t] 
| Cy, exp [—(B— V R= 02) tl. 
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At Q = 0 we obtain pọ (0) = 0 and A (0) = Ay = Fo/mo? = 
= F,/k, which is the static displacement of the pendulum from 
A 
| 
{,=0 
OS Bo <Bs <P, 
\ 
Ag oS 
| 
0 Wo R 


Fig. 29.3 


its equilibrium position by the constant force F, = Fy. At 
Q —> œ, the amplitude A (Q) > 0 and tan q, —> 0, whereas 


Po — —xn. The curves of the functions A (Q) and ọọ (Q) are 
shown in Figs. 29.3 and 29.4 for various values of the damping 
factor P. 
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29.2.3 The amplitude of the displacement in the case of steady- 
state forced harmonic vibration (oscillation) of the pendulum 
reaches its maximum value at the angular frequency 


Qr= V of — 28" = V oF — BR, 


where © is the angular frequency of free damped vibration of 
the pendulum (29.1.3). The frequency Q, is called the resonance 


frequency. 

The maximum amplitude is 
F mF 

Amax =A (Rr) = = > 


2mBo méw? ' 


where 6 is the logarithmic decrement of damping (29.1.4). 
When B < Op, Q, © Oo, Po (Q) =% —n/2 and Amax S QAo, 
where Q ~ n/ô is the Q factor of the pendulum (29.1.5) and 
A, is the static displacement (29.2.2). 

The drastic increase in the amplitude of forced mechanical 
vibration (or oscillation) as the angular frequency of the driving 
force approaches the value Q, is the phenomenon called mechan- 
ical resonance. Accordingly, the curves of the dependence of A 
on 2, shown in Fig. 29.3 are called resonance curves. 

As the damping factor B is increased, the peaks on the resonance 
curves are rapidly reduced (at small B values the amplitude 
Amax ~ 1/6) and the resonance frequency Q, is slowly de- 
creased. 

29.2.4 The velocity of the pendulum in steady-state forced har- 
monic vibration (oscillation) is 


dz 
dt 
Here Ay = AQ and a = pọ + 7/2 are the amplitude of the 


velocity and the phase difference between the velocity and the 
driving force: 


= — AQ sin (Qt + Mp) = Ay cos (Q¢+ a). 


Vy = 


FQ Fo 
mem V o? Q2)? -]-4R2Q2 = “(@2—222 
m Me — -}- W — 
i m V ako pae 
and 
»2— 02 
tan œ = — cot Gis Oe , 


2p2 
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The amplitude of the velocity is maximal at Q = ©, and 
equals 


F 
(Ay)max=Ay (Wo) = ome i 


In this case, a = 0, i.e. the velocity of the pendulum oscillates 
in phase with the driving force. As Q — oo, the amplitude 
Ay — 0 and a —> —n/2, whereas as Q — 0, the amplitude 
Ay — 0 and a>vn/2. 

29.2.5 The acceleration of the pendulum in steady-state forced 
harmonic vibration (oscillation) is 


d?r 


=e — AQ? cos (Q¢-++ Qo) = Aa COs (Qt 4- y). 


Ax 


Here A, = AQ? and y = Po + n are the amplitude of the 
acceleration and the phase difference between the acceleration 
and the driving force: 


a oe o a 
m V (oR—Q?)?-+ 4822? m V [(wo/Q)?— 1]? + (28/2) 


The amplitude of acceleration is maximal at 


= = V oP. 


Oo. 
V 1+ (B/o,)? 


At Q = 0, the amplitude A, = 0, whereas as 2 -» oo the 
ene of the acceleration tends to the value A, (œ) = 
= Fom. 

29.2.6 In steady-state forced vibration, the energy loss of the 
vibratory system, due to dissipative forces (1.3.7), is completely 
compensated by the work performed on the system by the driving 
force. The work, for example, done in one complete oscillation 
by the force of resistance acting on the spring pendulum is 


T L 


` 2, pP 

W= —b | (F) dt = —2mßA2Q2 | sin? (Qt + Qo) dt 
dt ) 

(0) 


Aa 


0 
= — mB A?Q?, 
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where b = 2m is the resistance coefficient (29.1.2). 
The work done during the same time by the driving force F, = 
= Fo cos Qt is 


T T 

W, = Fo \ = cos Ot dt= — AQF \ cos Qt sin (Qt Qo) dt 
0 0 

= + AQ Fo sin Pp = —W,, 

since sing, = —2mBAQ/F5. 


29.2.7 If the driving force acting on the spring pendulum varies 
periodically, but not according to a harmonic law, it can he 
represented as the sum of the harmonics of this force (28.4.6). 
These have different amplitudes, initial phases and angular 
frequencies that are multiples of Q = 2n/T, where T is the 
period of variation of the driving force. Since the pendulum is 
a linear vibratory (or oscillatory) system (29.1.1), each har- 
monic of the driving force acts on the system as if there were 
no other harmonics. Therefore, the steady-state forced oscil- 
lation of the pendulum, caused by an arbitrary periodic driving 
force, can be dealt with as the result of superposing steady-state 
forced oscillations due to each harmonic component of the 
driving force acting separately. The “contribution” of the various 
harmonic components to the resultant oscillation depends upon 
their frequency and amplitude. Owing to resonance phenomena, 
an essential role.is played by only the harmonic components of 
the driving force that have an angular frequency close to the 
resonance frequency of the pendulum (29.2.3). If the damping 
factor B of the pendulum is small, the pendulum can execute 
steady-state forced oscillation close to the harmonic one, even 
if the driving force is far from harmonic. An example of such 
forced oscillation is that of a pendulum subject to short periodi- 
cal external action in the form of impacts in a single direction 
and repeated over equal lengths of time, which are equal to the 
period of free oscillation of the pendulum. 


29.3 Forced Electrical Oscillation 


29.3.4 To obtain forced oscillation in an electrical oscillatory 
circuit (28.3.1) it is necessary to connect into it a source of 
electrical energy, whose emf 6 varies with time (Fig. 29.5). 
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In electrical engineering a source of electrical energy, charac- 
terized by its emf and internal electrical resistance, is called a 
seat, or source, of emf (voltage source). According to Ohm’s law 
for the portion J-R-L-2 (21.2.3), the quasi-stationary current 
(28.3.1), produced in the cir- 
T>0 R cuit in forced oscillation, can 

be found from the formula 


IR=V,—V,—L_+6 (0). 


AN, 7 
g] £ | 
Here Va — V, = q/C is the po- 
tential difference across the ca- 
a b pacitor plates, g is the capacitor 
E(t ) charge, and the internal elec- 
PAs trical resistance of the source 
Fig. 29.5 of emf is assumed to be neg- 


ligibly small compared to R 
(such a source of emf is said 
to be ideal). From the law of conservation of electrical charge 
(14.1.3) it follows that J = dq/dt. Hence, the differential equa- 
tion of forced electrical oscillation in a circuit can be presented 
in a form similar to the equation for forced mechanical vibra- 
tion (29.2.1): 
d*q dq ts 
dt? +2P -z +- weg a Te (t). 


Here p = R/2L is the damping factor for free oscillation in a 
circuit, and ©, = 1/V LC is the angular frequency of free 
undamped oscillation (i.e. at R = 0). 

29.3.2 If the driving emf © (t) varies according to the harmonic 
law € (t) = €, cos Qt, then at steady-state forced oscillation 
(29.2.1), the capacitor charge varies harmonically at the same 
angular frequency Q: 


q = q Cos (Qt + Po). 


The amplitude gq) and the initial phase ọọ are determined by 
the formulas 


€, _ E> 


qo = Ce ee 
Q yV Rt (QL—1/2C)? L V (@3— Q2)2-|- 4B2Q2 
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and 

tan Gp = ee Seen 
Po = “OE — 1/2C og — QF * 

At Q = 0, the phase gy (0) = 0 and gq, (0) = EC, which is the 

charge of the capacitor at a constant potential difference between 

the plates equal to €,. At Q — oo, the amplitude g, — 0, and 

Po > —x. The curves showing the dependence of Qo on Q are 

shown in Fig. 29.4, and those of the dependence of qo on Q, in 

Fig. 29.3, where A = q and Ay = qo (0) = €)C. 

29.3.3 With steady-state forced oscillation in the circuit, the 

current is 


== —— == — qo sin (Q¢-+ Po) = Lo cos (QE— ọ). 
The amplitude of the current J, = qQ and the initial phase 
—q = Po + 7/2 are determined by the formulas: 


Eo QL —1/2C 


| PE E EEE NEE 
°? V RFRQL—IRCE R 


and tan ọ= 


Curves showing the dependence of J, on Q for various values of 
R are called resonance curves of the oscillatory circuit and are 


Ly 


Fig. 29.6 Fig. 29.7 


shown in Fig. 29.6; curves of the dependence of ọ on 2 are shown 
in Fig. 29.7. The resonance angular frequency &,, corresponding 
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to the maximum amplitude of the current in the circuit in forced 
oscillation, is independent of R. Thus 


1 

V IC 

The current amplitude at resonance is J, (Q) = €,/R, and 
the phase difference between the current and emf is @ (Q,) = 0. 
When Q < p, ọ < 0, i.e. the current leads the emf in phase; 
the lower the frequency ©, the greater this phase lead (p = —m/2 
at Q = 0). When Q > ©, ọ > 0, i.e. the current lags in phase 
behind the emf; the higher the frequency Q, the greater this 
phase lag (p > n/2 as Q —> ov). 

29.3.4 The potential difference across the terminals a and b 


of an ideal source of harmonic emf (Fig. 29.5) is equal to its 
emf. Thus 


u = Va — Vo = 6p Cos Qt. 


Qr = Wo = 


The potential drops across various portions of the circuit ile 
lustrated in Fig. 29.5 and having the variable sinusoidal (har- 
monic) current J = J, cos (Qt — ọ), a capacitor of capacitance 
a of resistance R and a coil of inductance L are as 
ollows: 


uc=V;—V; = =U coos (or—e—+} 


ur = I R = Up cos (Qt—9Q), 


u= L f =U, cos (Q2—@ +- >) , 


The variation of up, istin phase with that of current J in the 
circuit, wz, leads the current in phase by 71/2, and uç lags behind 
the current in phase by x/2. Here 


uc + Up + ur = u = 6p Cos Qt. 


29.3.5 The amplitude values of uc, uz and up are equal, ree 
spectively, to 


Uc = XcI» Ur = Xl, and Up = RIp, 
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where Xç = 1/QC is the capacitive reactance of the circuit, and 
Xz, = QL is the inductive reactance of the circuit. The quantity 
X =X, — Xo = QL — 1/QC is called the reactance of the 
circuit, R is called the resistance and Z = V R?-+ (QL — 1/QC)? 
is called its impedance. 

The formulas in Sect. 29.3.3 for the amplitude of the sinusoidal 
current in the circuit and its initial phase can be rewritten in 
the form: J, = 6)/Z and tang = X/R, with cosg = R/Z 
and sing = X/Z. E 

At resonance Q = 1/y LC and Xz = Xç, so that the reactance 
of the circuit becomes zero and the impedance reaches its 
minimum value, which is equal to the resistance R. Thus 


X (Q,) = 0 and Z (Q,) = Zmin = R. 
In this case Up = 6) and Uc = UL = V LICE JR. 
29.3.6 The effective value of a periodic current (and, accordingly, 


the emf, voltage, etc.) is the root-mean-square value of the 
current during the time T of its variation. Thus 


1 co 
Ioe = T \ I? dt. 
0 


For a sinusoidai current and a sinusoidal emf 


An element of work done by sinusoidal current during the short 
time dt in the circuit illustrated in Fig. 29.5 is equal to 


SW = Iu dt = I6. cos (Qt — @) cos Qt dt. 


The instantaneous power of the current in the circuit is 


Ping= SUA = Iu = I ¿6o cos (t — p) cos Qt. 
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The average value of the instantaneous power during a period 
is called the active, or true, power P of the current in the electric 
circuit. Thus 


1 1 
P = F \ Pins da Iob COS P= Iettb etr COS Q. 


0 
The factor cos ọ is called the power factor. Since Iert = 6 o;/Z 
and cos ọ = R/Z, 


R 
Bor ° 


P = Fr 


At resonance Z = R and the active power reaches its maximum 
value 


CHAPTER 30 ELASTIC WAVES 


30.1 Longitudinal and Transverse Waves in an 
Elastic Medium 


30.1.1 A body is said to be elastic and its deformation, due to 
the application of external forces, is elastic deformation if it 
completely disappears after the external forces are removed. 
According to Hooke’s law elastic deformation is directly pro- 
portional to the external effects that cause it, i.e. it has a linear 
dependence on these effects. At sufficiently small deformation, 
all bodies can be assumed, practically, to be elastic. 

The elastic properties of bodies depend upon the nature of the 
thermal motion of their molecules and their forces of interaction. 
For example, a gaseous body changes its shape without hind- 
rance to suit the shape of the vessel it is contained in; gas has no 
elasticity of shape. But a gas has elasticity of volume, i.e. the 
capacity to resist changes in its volume. This property of a gas 
is due to the thermal motion of its molecules and is manifested 
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in the change in the pressure p of the gas when its volume V is 
changed. According to Hooke’s law for volume deformation, the 
change dp in the pressure of a a for a small change dV in its 
volume is directly proportional to the relative volume defor- 
mation, called the volume strain. Thus 


dp=—K 5 3 


where K is called the bulk modulus of elasticity of the gas. For 
an ideal gas (8.4.1) the value of K depends upon the kind of 
thermodynamic process of compression (or expansion) being 
carried out. At a very slow change in the volume of the gas, the 
process can be considered isothermal (8.3.7), whereas at a rapid 
change it can be considered adiabatic (8.3.7). In the former case 
K = p and in the latter K = xp, where x is the adiabatic 
exponent (9.5.11). 

30.1.2 The elasticity of a crystalline solid is due to the forces 
of mutual attraction and repulsion of its particles (ions, atoms 
or molecules) that make up the body and that have random ther- 
mal vibration about the points of its crystal lattice. The forces 
of interaction of the particle impede deformation of the crystal 
lattice that is assocíated with both the volume and shape of the 
body. Therefore, in addition to their bulk elasticity, solids also 
have elasticity of form, which is manifested by their resistance 
to shear deformation (40.3.9). 

The elasticity of liquids is also due to the forces of intermolecular 
interaction. But, owing to the fact that the average time t that 
molecules of a liquid remain in a temporarily settled state 
(13.2.4) is very short, liquids, like gases, possess only bulk 
elasticity. They manifest elasticity of form only with respect 
to variable deformation of ultrahigh frequency with a period 
that is less than or of the order of q. 

30.1.3 Elastic waves are mechanical disturbances (deformations) 
propagated in an elastic medium. Bodies that act on the medium 
and cause these disturbances are called sources of the waves. 
A member of the audience in a theatre, for instance hears the 
speech and singing of the actors and the sounds of the musical 
instruments due to the variations in the pressure of the air, 
reaching the audience and caused by the sources of sound. 
Sonic, or acoustic, waves are elastic waves of low intensity, i.e. 
weak mechanical disturbances propagated in an elastic medium. 
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Sonic waves, acting on the organs of hearing of people, are 
capable of producing auditory sensations if the frequency v of 
their corresponding vibrations lies in the range from 16 to 
2 X 104 Hz (audible sound). Elastic waves with a frequency 
y < 16 Hz are said to be infrasonic, whereas those with a fre- 
quency v> 2 X 10+ Hz are ultrasonic (elastic waves with 
y > 10° Hz are often said to be hypersonic). 

30.1.4 The propagation of elastic waves in a medium does not 
involve the transport of matter*. In a boundless medium, it 
consists in exciting forced vibrations of parts of the medium 
that are farther and farther away from the source of the waves. 
It is feasible here to digress from the discrete (molecular) struc- 
ture of the medium, dealing with it as a continuous medium, 
distributed continuously in space and possessing definite elastic 
properties. A particle of such a medium, having forced vibration, 
is understood to be a small element of its volume whose size, 
however, is many times greater than the intermolecular distances 
so that it contains an enormous number of molecules. Practical- 
ly, such a particle can be dealt with as a point because even in 
a gas the intermolecular distances are extremely small (of the 
order of 10-8 m under standard conditions). 

30.1.5 An elastic wave is said to be longitudinal if the particles 
of the medium vibrate in the direction of wave propagation. 
Longitudinal waves are associated with bulk deformation of the 
elastic medium and can therefore propagate in any kind of 
medium: solid, liquid or gaseous. An example of such waves 
are the sonic waves in air. 

An elastic wave is said to be transverse if the particles of the 
medium vibrate, remaining in planes perpendicular to the direc- 
tion of wave propagation. Transverse waves are associated with 
shear deformation of the elastic medium, and, consequently, 
can be generated and propagated only in media having elasticity 
of shape, i.e. in solids. An example of transverse waves are 
those propagated along the strings of musical instruments. 
pean properties are found in surface waves, which propagate 
along the free surfaces of liquids (or the interface between two 
immiscible liquids), being a perturbance of the surface due to 


* A certain transport of matter (mass transfer) may occur in the 
propagation in a medium of strong disturbances (for example, 


shock waves in explosions), when the vibration of the particles 
of the medium becomes nonlinear. 
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external effects (dropping of bodies into the liquid, the motion 
of ships, the wind, etc.). The forces of gravity and surface tension 
play a decisive role in the generation and propagation of these 
waves. In surface waves the particles of liquid have transverse 
and longitudinal vibrations simultaneously, describing elliptical 
or more complex paths. 

30.1.6 A medium is said to be uniform if its physical properties, 
essential in the problem being considered, do not change from 
point to point. A medium that is uniform with respect to cer- 
tain physical properties may be nouniform with respect to others. 
A monocrystalline body, for example, is uniform with respect 
to its elastic properties and, at the same time, optically non- 
uniform for X-rays. 

A medium is isotropic if its physical properties that are essential 
in the problem being considered are the same in all directions. 
A medium that is isotropic with respect to certain physical 
properties may be anisotropic with respect to others. Cubic 
crystals, for instance, are optically isotropic, but are anisotropic 
with respect to other properties. Gases and liquids are isotropic 
with respect to any physical properties. 

30.1.7 A medium is said to be linear if there is a directly pro- 
portional relation between the quantities characterizing the 
external effects being considered that act on the medium and 
the changes they lead to in the state of the medium. An elastic 
medium, for example, obeying Hooke’s law (30.1.1) is linear 
with respect to its mechanical properties. A dielectric is a linear 
medium with respect to its electrical properties if its dielectric 
constant (14.2.4) is independent of the electric field strength. 
By analogy, a magnetic material is a linear medium with re- 
spect to its magnetic properties if its relative magnetic per- 
TET (26.4.5) is independent of the magnetic induction of 
the field. 


30.2 Travelling Wave Equation 


30.2.1 Travelling waves are ones which, in contrast to standing 
waves (30.5.4), transport energy from one part of space to 
another. 

The travelling wave equation is the dependence on the coordinates 
and time of the scalar or vector quantities that characterize the 
vibration of the medium as the wave being considered is prop- 
agated through it. For waves in a solid medium, for instance, 


26—0186 
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such a quantity may be the vector of displacement of the par- 
ticles of the medium (30.1.4) from their equilibrium position or 
the three projections of the vector on the coordinate axes. Longi- 
tudinal waves in a gas or liquid are usually characterized by the 
surplus pressure of the vibrating medium, equal to the differ- 
ence between its variable and equilibrium pressures. 

A ray is a curve whose tangent at each point coincides with the 
direction of propagation of a wave, i.e. with the direction of 
energy transport by the wave (30.3.5). In a uniform medium 
(30.1.6) rays are straight lines. 

30.2.2 An elastic wave is said to be sinusoidal, or harmonic, if 
the corresponding vibration of particles of the medium is har- 
monic (28.1.3). The frequency of this vibration is called the 
wave frequency. Variations in pressure in a gaseous or liquid 
medium through which a sinusoidal wave propagates also take 
place according to a harmonic law at a frequency equal to the 
wave frequency. In a transverse sinusoidal wave, particles of the 
medium can simultaneously vibrate with harmonic motion at 
the wave frequency in two mutually perpendicular directions, 
each of which is perpendicular to the direction of wave prop- 
agation. Depending upon the kind of polarization of the resultant 
vibration (28.4.9), distinction is made between the following 
types of polarization of transverse sinusoidal waves: elliptical, 
circular and linear (or plane). 

30.2.3 A mechanical disturbance (deformation) propagates in 
an elastic medium at the finite velocity v. Therefore, distur- 
bances, initiated by the wave source at the instant tọ of time, 
reach an arbitrary point M of the medium at the instant t > tọ. 
The greater the path J that the wave travels from its source to 
point M, the greater the difference t — tọ = I/v. Consequently, 
the vibration at point M lags in phase behind that at the source 
of the waves. | 
The wave surface, or wavefront, is the locus of all the points at 
which the phase of the vibration has the same value at the 
instant of time being considered. The difference ¢ — tọ is the 
same for all points in a wavefront. Through each point of a 
medium subject to wave motion one wavefront can be passed 
that corresponds to the value of the phase of the vibration at 
this point at the instant of time being considered. Many different 
values of the phase of a vibration correspond to a family of 
wavefronts. In a uniform isotropic medium (30.1.6) the wave- 
fronts are orthogonal to the rays (30.2.1). 
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30.2.4 A wave is said to be plane if its wavefronts constitute 
i system of parallel planes. In a plane wave, propagating 
ulong the OX axis, all values of s, characterizing the vibratory 
motion of the medium, depend only upon the time ¢ and the 
coordinate z of the point M being considered in the medium. 
If there is no absorption of waves in the medium (30.3.7), the 
vibration at point M differs from that at the origin of coordinates 
© only in that it is shifted in time by the amount z/v, where v is 
the wave velocity. Hence, in a plane wave, propagating along 
the positive direction of the OX axis, s is a function of the differ- 
ence (t — z/v) and the equation of such a plane wave is 


10-4). 


Accordingly, the equation for a plane wave propagating in the 
opposite direction is 


pf € -4 =| 


30.2.5 The equation of a plane sinusoidal, or simply sine, wave, 
propagating in a monabsorbing medium along the positive 
direction of the OX axis, is 


j Vv 


s <A sin E (e-=] +0 |=4 sin ( ot— > =+90), 


or 


where A = const is the amplitude of vibration and is called the 
wave amplitude, © = 2n/T is the angular frequency of the wave, 
1’ is the period of vibration, and @, is the initial phase of the 
vibration (at the instant of time ¢ = 0) in points of the coordi- 
nate plane z = 0. The quantity D = @t — wz/v + Po, equal to 
the phase of the vibration at an arbitrary point with the coordi- 
nate z, is called the phase of the plane wave (see also Sect. 30. 2.6). 
30.2.6 The distance à = vT that a sine wave travels during the 
time equal to the period of vibration is called the wavelength. 
The wavelength is equal to the distance between two closest 


20k 
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pomi in a medium in which the phase difference of the vibration 
equals 2x. 

Along with the wavelength, use is made of another characteris- 
tic of a sine wave. It is the wavenumber 


pot 27 oO 
= KO eT v’ 


Hence, the equation of a plane sine wave (30.2.5) can be written 
in the form 


s= A sin (o —- at o) = Á sin (ot— kz + po). 


Accordingly, the phase of this plane wave is D = wt — kx + Qo. 
30.2.7 The wave vector is the vector k, equal in magnitude to the 
wavenumber k and having the direction along the ray (30.2.1) 
at the point M being considered in the medium. The wave vector 
of a plane sine wave does not depend on the choice of point M, 
and the equation for such a wave can be written in the form 


s = Á sin (wit — kr + aq), 


where r is the radius vector of point M, and a is the initial 
phase of vibration at the origin of coordinates, i.e. at the point 
r= 0. 

On the basis of the Euler formula (28.1.7), the equation of a plane 
sine wave can be written in exponential form, which is con- 
venient for differentiation: 


~~! . t- ô 
s — Aet kr+ ). 


where ô = a — x/2. Only the real part of the complex quantity 
‘sis of physical meaning, i.e. the quantity s = Re fs}. Therefore, 


in making use of s to determine some characteristic of a wave, 
it is necessary to discard the imaginary part of the final complex 
expression. 

30.2.8 A wave is said to be spherical if its wavefronts are con- 
centric spheres. The centre of these spheres is called the wave 
centre. Such waves are generated in a uniform isotropic medium 
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by a solitary point source. The equation of a diverging spherical 
wave is of the form 


tsk), 


where r is the distance from the wave centre to the point M 
being considered in the medium, and v is the wave velocity. 
ln the case of a spherical sine wave 


a . 
= sin (wi— kr--a), 


where ao/r = A (r) is the wave amplitude (30.3.6), a) is a 
physical quantity, numerically equal to the wave amplitude at 
unit distance from the wave centre, œ is the initial phase of 
vibration at the wave centre, and ® = wi — kr + a is the 
phase of the spherical wave. 

In the exponential form, the equation of a spherical sine wave is 


20 pi(@t—kr+5)__ 70 ei(@t—hr+ 6). 


r r 


where 6 = a — 2/2, fis the radius vector from the wave centre 
to the point M being considered, and wave vector k at point M 
ix in the direction radially away from the wave centre. 

teal sources of waves are always of finite size. But they can be 
dealt with as point sources and the waves they generate in a 
uniform isotropic medium as spherical ones if the distance r 
from the source to the points being considered in the medium is 
considerably greater than the dimensions of the source. When r 
ıs extremely great, any small portions of a wavefront can be 
revarded as being plane surfaces. 

‘0.2.9 The propagation of waves in a uniform isotropic medium 
van be described by a partial differential equation, called the 
wave equation. Thus 


02s 02s 1 @%s 1 os 
ay? a a ee oe T «ASS or or: 


d*s 
dx? 
llere s is a physical quantity that characterizes a disturbance 


propagating in the medium at the velocity v, A = 02/dz2 + 
| 0*/dy* + 6?/d22 is the Laplace operator, or Laplacian. This 
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equation is satisfied, among others, by a plane wave (30.2.7) 
and a diverging spherical wave (30.2.8). 

Function s, characterizing a sine wave in a uniform isotropic 
medium, satisfies two partial differential equations at the 
same time: 


As = —k?s, 

where k is the wavenumber (30.2.6), and 
02s 

——_ $a ——. 2 

TE w2s, 


where © is the angular frequency of the wave.. 

30.2.10 The velocity v of propagation of a sine wave is called 
the phase velocity. It is equal to the velocity of displacement in 
space of the points in a surface that corresponds to any fixed 
value of the phase of a sine wave. For example, in the case of a 
plane sine wave (30.2.6) it follows from the condition œt — kz + 
+ Po = const that dz/dt = o/k = v. In the same way, in the 
case of a spherical sine wave (30.2.8), it follows from the con- 
dition ot — kr + a= const that dr/dt = w/k = v. 


30.3 Phase Velocity and Energy of Elastic Waves 


30.3.1 The phase velocity of sonic waves (velocity of sound) in a 
liquid or gas is 


„=y Z 
p’ 


where p is the density of the undisturbed medium, and K is 
the bulk modulus of elasticity of the medium (30.1.1). The 
frequency of audible sonic waves v > 16 Hz, and the process 
of deforming the medium can be assumed to be adiabatic (8.3.7), 
i.e. K = —V (dp/dV)aq. The relation between the pressure p 
and volume V for an ideal gas (8.4.1) in an adiabatic process is: 


pV” = const, where x is the adiabatic exponent (9.5.11), so that 
K = xp. Therefore, the velocity of sound in an ideal gas is equal 


to 
— Key “RT 
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where p is the molar mass of the gas, T is its absolute tempera- 
lure, and R is the universal gas constant. 

30.3.2 The phase velocity of transverse elastic waves in a uniform 
isotropic solid medium is 


f= 
R == p? 


where G is the shear modulus of the medium (40.3.9) and 0 is 
its density. 

The propagation of longitudinal waves in a thin long rod is 
associated with its longitudinal tension and compression. 
Ifence, the phase velocity of such waves is 


E 
=V a 


where E is Young’s modulus (40.3.6) for the rod material. 
The velocity of propagation of transverse waves along a stretched 
string (a thin flexible cord) is 


os vV F j 
— TT 
where F is the stretching force (tension) in the string, and p and 
S are the density of the string material and the cross-sectional 
area of the string. 
30.3.3 An elastic medium in which mechanical waves propagate 
has kinetic energy of vibratory motion of its particles and 
potential energy due to deformation (30.1.1). If v, is the velocity 


of particles of the medium, then the volume kinetic energy density 
of the medium is 


where p is the density of the medium, and dE, is the kinetic 
energy of all the particles in the small volume dV of the medium. 
This volume has been selected so that throughout it the velocity 
vı is the same. | 
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The volume potential energy density of an elastically deformed 
medium is 


where dEp is the potential energy of a uniformly deformed 


small portion of the medium of volume dV, v is the phase ve- 
locity in the medium, and s is the relative deformation. : 
The volume energy density of elastic waves is understood to be 
the volume density £4 of the mechanical energy of a medium due 
to the propagation of these waves. Thus 


Ea = EÈ- KA (vi + ve). 


30.3.4 If a longitudinal plane travelling wave (30.2.4) prop- 
agates in a medium, then vı = 0s/dt, where s is the displace- 


ment of the particles, and e = 0s/dxr = —v,/v, so that 
E1--E4 and Et=pyt—p (2 ° 


At each point of the medium subject to wave motion, Ee and 


E® are alike functions of time. Consequently, £4 also varies 


with time. This principle is valid for any travelling waves in 
an elastic medium, regardless of the shape of their wavefronts 
and the type of deformation of the medium. It follows from the 
law of conservation of energy as applied to the process of the 
propagation of vibrations in an elastic medium. To set regions 
of the medium more and more distant from the wave source 
into vibratory motion it is necessary to expend energy, which 
is supplied to the medium by the source. Consequently, the 
propagation of elastic waves is inseparably associated with the 
transport of energy from certain parts of the medium to others. 
This is precisely why the volume energy density £4 of the waves 
depends both on the coordinates and on the time as well. 
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l'or a plane travelling sine wave in a nonabsorbing medium 
(30.2.5) 


Id = 9 A*a@? cos? (at — kx + Qo) 


=> 0420? [1+ cos 2 (@t— kz + Qo)], 


where A = const is the wave amplitude. 
In the case of a diverging spherical sine wave in a nonabsorbing 
medium (30.2.8) 


kd = pA??? cos? (@t — kr + a), 


where A = a/r is the amplitude of 
(he wave. 

The average volume energy density 
over one period is 


(Ea) = . 04202. 


Fig. 30.1 


30.3.5 The velocity of energy transfer 

by a wave is equal to the velocity in 

space of a surface corresponding to the maximum value of 
the volume density #4 of the wave’s energy. For sine waves 
this velocity is equal to the phase velocity v. 

The energy flux dD, through the small surface dS is the ratio 
of the energy dE transferred through this surface in a short 
time interval to the magnitude of this interval dt. Thus 


dD pe = dE/dt. 


If v is the velocity vector of energy transfer by the wave 
(ig. 30.4), then 


dE = E4y dt dS cosa = Eå (v dS) dt 
and 
dp = Ed (v dS) = (U dS), 


where £4 is the volume energy density of the wave, dS = n dS 
is the vector of surface dS, n is a unit vector of the normal to 
Lhe surface, and æ is the angle between v and dS, 
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The vector U = Edv, having the direction of energy transfer 
by the wave, is called the Umov vector (energy flux density vector 
of the wave). In magnitude it equals the ratio of the energy flux 
dp through the small surface dS to the area dS, = dS cosa 


of the projection of this surface on a plane perpendicular to the 
direction of energy; transfer: U = dOg/dS,. 


30.3.6 The wave intensity I is the magnitude of the average value 
of the Umov vector. Wave intensity is numerically equal to 
the energy transferred by the wave in unit time through unit 
area of a surface perpendicular to the direction of wave prop- 
agation. The intensity of a travelling sine wave is proportional 
to the square of its amplitude. For plane and spherical sine 
waves 


I= |(U)| =v (Eð) = > eve? A?. 


When a spherical wave propagates in a nonabsorbing medium, 
the same amount of energy is transferred in unit time through 
any spherical surface of radius r with its centre at the wave 
centre. This energy is equal to that expended during the same 
time by the wave source: 4nr?I = const. Hence, the intensity 
and amplitude of a spherical wave decrease with an increase 
in the distance from the wave centre according to the relations 


I()=—t and A=, 


where i, and a, are physical quantities numerically equal to the 
intensity and amplitude of the wave at the distance r = 1 m 
from the wave centre. : 
It can be shown in the same way that in the case of a plane sine 
wave in a nonabsorbing medium, the amplitude A of the wave 
is independent of the coordinates. 

30.3.7 The conversion of wave energy into other forms of energy, 
taking place in the propagation of waves through a medium, is 
called wave absorption. In a uniform medium the absorption of 
elastic waves is due chiefly to the processes of internal friction, 
or viscosity (10.8.4), and heat conduction (10.8.5). The amplitude 
A and intensity J of a plane wave, propagating in an absorbing 
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medium along the ae direction of axis OX, varies according 
to the exponential law 


A(a)=Age7™* and I (x)= Iye7 2%*. 


Here Ay and J, are the amplitude and intensity of the wave at 
the points z = 0, and a is the linear absorption coefficient for 
elastic waves and depends upon the properties of the medium and 
the wave frequency. 

30.3.8 The dispersion of waves is the dependence of the phase 
velocity of sine waves in a medium on their frequency. A medium 
in which this phenomenon is observed is said to be a dispersive 
medium. The dispersion of acoustic waves in a boundless me- 
dium depends upon the properties of the medium and is always 
accompanied by the absorption of sound. 


30.4 Principle of Superposition of Waves. Group 
Velocity 


30.4.1 The principle of superposition of waves states that in a 
linear medium (30.1.7) waves are propagated independently of 
one another, so that the resultant disturbance at any point in 
the medium upon the simultaneous propagation of several waves 
is equal to the sum of the disturbances corresponding to each 
of the waves separately. 

For example, if n different mechanical waves are propagated 
simultaneously through a linear medium the resultant displace- 
ment s, velocity v and acceleration a of the particles of the 
medium at the arbitrary instant ¢ of time are 


n n 
s= \) Sj; v= >) V; and a=») aj, 
i=l i=1 i=1 


where s;, v; and a; are the values of the displacement, velocity 
and acceleration that the particles being considered would have 
at the instant of time ¢ if only the ith wave was propagated in 
the medium. 

30.4.2 On the basis of the principle of wave superposition and 
a Fourier expansion (28.4.6 and 28.4.7) any nonsine wave can be 
replaced by an equivalent system of sine waves, i.e. it can be 
represented in the form of a group of waves, called a wave train, 
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or packet. The totality of frequency values for these sine waves 
is called the frequency spectrum (or simply spectrum) of the 
nonsine wave. In accordance with the nature of the oscillation 
excited by the wave, the wave’s frequency spectrum can be 
either discrete (28.4.6) or continuous (28.4.7). 

The law of propagation of anarbitrary disturbance (signal), which 
is a nonsine wave, in a linear medium is simple only provided 
that the medium is nondispersive (30.3.8). In this case the signal 
travels in the medium without changing its “shape” because all 
the sine waves making up the wave train have the same phase 
velocity, which is equal to the velocity of the signal. 

In a dispersive medium, the sine components of the wave train, 
corresponding to the nonsine wave, propagate at different 


S 

by 

nN 

aa A — 
g X 
Fig. 30.2 


velocities. Therefore, this wave train “spreads” as it propagates, 
thereby changing the “shape” of the signal. If, for example, a 
signal propagating along the OX axis in a dispersive medium 
had the “shape” shown by dash lines in Fig. 30.2 at the instant 
of time ¢,, then at the instant t, > t it could have the different 
“shape” shown by a solid line. 

30.4.3 The simplest wave train is the plane quasi-sine wave 
obtained by the superposition of two plane waves of the same 
amplitude and with frequencies and wavenumbers close in 
value propagating along the OX axis. Thus 


s= Ay sin (@t— kx) -++ Ap sin [(©-+ dw) t— (k+ dk) x] 
tdw—x dk 


= 2A) cos ( 3 


sin (@t—kz). 
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The dependence of s on z at some fixed instant of time is shown 
in Fig. 30.3. This wave differs from a sine wave in that its am- 


plitude 
A= 2A 9 cos (=) | 


is a slowly varying function of the coordinate z and time tł. 
Taken as the velocity of propagation of this nonsine wave is a 


Fig. 30.3 


velocity u of point M in which the amplitude A has some fixed 
value (for example, A = 0 or A = 2A,). Consequently, the law 
of motion of point M is: t do — x dk = const. From this 
aa 
= at dk” 
Velocity u is called the group velocity. It is equal to the velocity 
of energy transfer by a quasi-sine wave. The group velocity 
u = d/dk is suitable for describing energy transfer (signal 
transmission) by nonsine waves having a different frequency 
pema provided that the spectrum is not very wide and that 
the dispersion of waves in the medium at these frequencies is 
not too great. 
The relation between the group velocity (u = d@/dk) and the 
phase velocity (v = @/k) of waves is of the form: 

dv 
u=v— À de , 
where À is the wavelength (30.2.6). In a nondispersive medium 
dv/dà = 0 and the group and phase velocities coincide. 
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30.5 Interference of Waves. Standing Waves 


30.5.1 Two waves are said to be coherent if their phase difference 
is independent of time. Coherent waves correspond to coherent 
oscillations (or vibrations) 
(28.4.3). The sources of co- 
herent waves are called coher- 
ent sources. Sine waves of 
equal frequency are always 
coherent. Waves of different 
frequencies are coherent only 
for the length of time that 
the oscillations (or vibra- 
tions) they excite are co- 
herent (28.4.4). In super- 
posing incoherent sine wa- 
Fig. 30.4 ves, generated by point 
rb sources Sı and Sz (Fig. 30.4), 
the square of the amplitude 
A of the resultant nonharmonic vibration at arbitrary point 
M varies periodically with time ¢ according to the relation 
(see Sects. 28.4.2 and 30.2.8): 


A? = A? + A} + 2414, Cos [(@, — 04) ż 
== (kara Fm kırı) a (O. a &ı)l. 


Here A, and 4>, ©; and O, kı and k,, and a, and a, are the 
amplitudes at point M, angular velocities, wavenumbers and 
the initial phases of the two spherical waves being superposed. 
The period of variation of A? is equal to T = 2n/| @, — @; |. 
The average value of the square of the amplitude during the 
period is 


(A?) = A? + 42. 


In the superposition of incoherent waves, the squares of their 
amplitudes are added together. 

30.5.2 The interference of waves is the phenomenon, in super- 
posing waves, of their mutual strengthening, stable with time, 
at certain points in space and their weakening at others, depend- 
ing upon the relations between the phases of these waves. 
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Only coherent waves display interference, provided that they 
correspond to vibrations in the same or almost the same direc- 


tion. 
In superposing coherent spherical waves, generated by the point 


sources Sı and S, (Fig. 30.4), 
5S = Ay sin (ot — kry -+ Oy) = Ay sin OD, 


and 
So = A» sin (Ot = kro + Ola) — Ao sin D»; 


amplitude A and phase ® of the resultant harmonic vibrations 
at point M (where s = s, + s = Asin Ọ) are determined 
from the relations (28.4.3): 

A? = Aj + 43-4-2414, cos [k (ra— r1) — (a — &1)], 

A, sin D, + A, sin MD, 

A, cos D,-++ A, cos P, ° 

Since the difference of the initial phases a, — a, = const for 
the vibration of the coherent sources S, and Sj, the resulting 
interference of the two waves at various points M depends on 
the quantity A = r,-— r,, called the path difference of the waves. 
At interference maxima, the amplitude of the resultant vibration 
is A = A, + As, whereas at interference minima A = 
= | A; — A, |. Maxima are observed at the points M that 
comply with the condition 


kA — (&, — &) = +2mn, 


tan D = 


where m = 0, 1, 2,.. . is the order of the interference maximum. 
The condition for interference minima is of the form 


kA — (Gg — %) = + (2m — 1) a, 


where m = 1, 2, 3, . . . is the order of the interference minimum. 
Since the wavenumber k = 2n/A, where à is the wavelength in 
the given medium, the conditions for interference maxima and 
minima can be presented in the form: ~ 


A= mh} 2 à (maxima), 
_ N | hp— Oy ed 
A= (2m — 1) -5t E à (minima). 


416 ELASTIC WAVES [30 


Finally, when a, = o,,3the conditions take the form: JA = 
= +mÀ (maxima) and A = + (2m — 1) A/2 (minima). 

On the straight line ab, parallel to and at the distance Z from 
the line passing through the sources S; and S>, (Fig. 30.4), the 
central maximum of zero order is at point O, equidistant from 
sources S4 and Sə. If the distance between the sources l < L, 
then for point M on line ab, at the distance z < L from point O, 
the path difference of the waves is 


Maxima of the mth and (m + 1)th orders correspond to the 
values 


AL 1) AL 
Zm = = and 2m+1 ~ meriin, ’ 


so that the difference between adjacent maxima equals AL/1. 
30.5.3 When waves interfere, their energies cannot be simply 
added together. At interference maxima, the intensity of the 
resultant wave is greater than the sum of the intensities of the 
superposed waves, and at minima, it is less than their sum. 
Wave interference leads to a redistribution of vibrational energy 
between neighbouring regions of the medium. But on an average, 
for a large region of space, the energy of the resultant wave is 
equal to the sum of the energies of the interfering waves. This 
follows from the laws of conservation and conversion of energy. 
30.5.4 A special case of wave interference is standing waves. 
A. standing wave is one that is formed by the superposition of 
two travelling sine. waves that are propagating toward each 
other and have the same frequency and amplitude. In the case 
of transverse waves, they also have the same polarization 
(30.2.2). A standing wave is formed, for instance, in a stretched 
flexible string when one end is fixed and vibratory motion is 
imparted to the other end. 

In the superposition of two coherent travelling plane waves of 
the form 


sı = A sin (@t — kx) and s, = A sin (Ot + kr +a), 
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where @ is the phase difference at points z = 0, a plane standing 
wave is formed. It is described by the equation 


S= S1 F- S = 2A cos (zz + +) sin (o+) ; 

In contrast to the amplitude A of travelling waves, the ampli- 
tude Ast of a standing wave is a periodic function of the coor- 
dinate z: | 


cos (xz -} =| . 


30.5.5 Points of a standing wave at which Ast = 0 are called 
its nodes; points at which the amplitude Ast reaches its maxi- 
mum value (Ast = 2A) are called the antinodes. The positions 
of the nodes and antinodes are determined from the conditions: 


ka +- Oni -+ jy (nodes), 
2 2 
a ; 

kz -|- z =mi (antinodes), 


where m = 0, 1, 2, .... 

The distances between two adjacent nodes and between two 
adjacent antinodes are the same and equal to half the wave- 
length à of the travelling waves. This distance is called the 
length of the standing wave: Ast = A/2. The distance between 
a hii nodes and antinodes of a standing wave is equal to 
st/2. 

30.5.6 In a travelling wave the phase of the vibration depends 
upon the coordinate z of the point being considered. In a stand- 
ing wave all the points between two nodes vibrate with differ- 
ent amplitudes, but in the same phase (cophased) because the 
argument of the sine in the equation of a standing wave (30.5.4) 
is independent of the coordinate z. In passing through a node, 
the phase of the vibration changes abruptly by x, so that the 
sign of cos (kx 4- a/2) is reversed. 

Illustrated in Fig. 30.5 is the nature of the motion of various 
points on a stretched flexible string of length 7 when a trans- 
verse standing wave is established in it. The left end O of the 
string is set into harmonic motion and the right end N is fixed. 


27--0186 
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In this case, as the wave is reflected from the place where the 
string is fixed, its phase is changed by n, so that a node of the 
standing wave is formed at this point. The phase difference 
of the reflected and incident waves at point O (where z = 0) is 
a = — (2kl + x). The nodes of the standing wave are designated 


C=tby 


Fig. 30.5 


by small circles in Fig. ae Bs and the instant of time t, is selected 
so that sin (Ot) + a/2 


30.5.7 In a standing wave (30. 5.4), the velocity of vibratory 
motion of the particles of the medium is 


Os 1 Q 
v = Z =240 cos (+5) cos (o+ +} , 
and the relative deformation of the medium is 


ðs ; aœ i.e od 
= pr ~ 24k sin (z +5) sin§( ot + Z) 


=2Ak sin (r45) cos (we + $+) s 
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Thus, in contrast to a travelling wave (30.3.4), in a standing 
wave ¢ has a phase lead of n/2 over v,. Consequently, at the 
instants of time when v; reaches its amplitude value, e becomes 
zero and vice versa. Moreover, the amplitudes of v, and e depend 
upon the coordinate zx, but in different ways. Thus, the anti- 
nodes of the particle velocity and nodes of deformation of the 
medium are at the antinodes of the standing wave (30.5.5), 
whereas the antinodes of the deformation and the nodes of the 
velocity are at the nodes of the standing wave. 
In an elastic standing wave, the energy is periodically converted 
from potential energy, localized mainly near the deformation 
antinodes, into kinetic energy, localized mainly near the velocity 
antinodes, and back again. Therefore, energy periodically 
migrates from the nodes of a standing wave to its antinodes and 
back again. But in the nodes and antinodes themselves the 
energy flux density is identically equal to zero. The average 
value of the energy flux density over one period is equal to zero 
at any point on a standing wave, because the two travelling 
waves that form the standing wave transmit equal amounts of 
energy over a period in opposite directions. This is why they 
have been called standing waves. 
30.5.8 In the free vibrations of strings, rods and columns of gas, 
standing waves are set up in them. The frequencies of these 
waves comply with definite conditions, i.e. they can have only 
certain discrete values, which are called the natural frequencies 
of vibration of the corresponding vibratory system. Displace- 
ment nodes (deformation antinodes) are located at the rigidly 
fixed ends of strings or rods, and displacement antinodes (defor- 
mation nodes) are located at the free ends of the rods. In the 
vibrations of a cylindrical column of gas in a pipe, a pressure 
antinode is located at the closed end of the pipe and a pressure 
node at the open end. 
If Z is the length of the string, rod or gas column, v is the phase 
velocity of the wave and A is the wavelength, then for strings 
and rods fixed at both ends and for gas columns in pipes that 
are open or closed at both ends, a whole number of lengths 
Àst = A/2 of the standing wave fits into the length l, i.e. 
l = mAs, = md/2, where m= 1, 2, 3,... 
The natural frequency of vibrations of such systems is 

mv 


27% 
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For rods fixed at one end and free at the other, or for pipes 
closed at one end and open at the other, 


f= (2m—1) Ast -=(2m—1) 4 


and the natural frequency of vibration is 


_ (2m—1)v 
a 4l ° 


30.6 Doppler Effect in Acoustics 


30.6.1 The Doppler effect refers to the change in the frequency 
of waves, observed or recorded by a receiver, or detector, which 


es 


2 
(d) ee EE ee 


V; 
(e) E 


Vs 


Fig. 30.6 


is due to motion. of the 
source of the waves and the 
detector (or observer). For 
example, as a rapid train 
approaches a stationary ob- 
server, the pitch of the loco- 
motive whistle is higher, 
and as the train passes the 
observer and travels away 
it is lower, than the pitch 
of the same whistle when 
the train is standing still. 

Assume that the detector 
D of acoustic waves in a gas- 
eous (or liquid) medium 
is fixed with respect to the 
medium, and that the source 
S is moving away at the 
velocity v, along a straight 
line joining them (Fig. 30.62). 
During the period Tọ of 
its vibration, the source 
travels in the medium a 
distance of ViTo = V1/Vo; 
where v, is the source’s fre- 
quency of vibration. There- 


fore, upon motion of the source the wavelength A in the medium 
differs from its value A, when the source is stationary: 


A = ho + Pq = (v + vi) To = (V+ vi), 
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where v is the phase velocity of the wave in the medium. The 
frequency of the wave recorded by the detector is 


v Vo 
Ņy = = 


À 1+-v,/v ° 


If the velocity vector v, of the source makes the arbitrary angle 
0), with the radius vector R, connecting the stationary detector 
to the source (Fig. 30.66), then 


eee ees 
1+v cos 0,/v ° 


30.6.2 When the source is stationary and the detector approaches 
it at the velocity v, along a straight line joining them (Fig. 30.6c), 
the wavelength in the medium is A = A, = v/v. But the veloc- 
ity of propagation of the wave with respect to the detector is. 
v + va Hence, the wave frequency recorded by the detector is 


y= 


n ev (1+2) : 


In the case when the velocity vector v makes the arbitrary 
angle 6, with radius vector R, connecting the travelling detector 
to the stationary source (Fig. 30.6d), 


v=vo (142 cos 0, | { 


30.6.3 In the most general case, when both the detector and 
source of acoustic waves are travelling with respect to the 
medium at arbitrary velocities (Fig. 30.62), 


14+ cos 0, 
MV MOC = S 
14 cos 0, 


This formula can also be presented in the form 


2 
v=v {1—2 [17 cos0, + (Z cos 6; } = sel : 


v 
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where V = v;i — Və is the velocity of the wave source with 
respect to the detector, and © is the angle between vectors V 
and R. The quantity V cos 0, equal to the projection of V 
on the direction of R, is called the ray velocity of the source. 
When v < v, | 


V cos 8 
; ; 


VY Vo (1— 


CHAPTER 31 ELECTROMAGNETIC WAVES 


31.1 Properties of Electromagnetic Waves 


31.1.4 Electromagnetic waves are disturbances of an electromag- 
netic field (i.e. a variable electromagnetic field) that prop- 
agate through space. The statement of the existence of electro- 
magnetic waves follows directly from Maxwell’s equations 
(27.4.3). For an electromagnetic field, distant from the free 
electric charges (18.3.1) and macroscopic currents (26.4.1) that 
it was set up by, these equations are of the form* 


OB aD 
curl E = a curl rr j 
div D = 0, div B=0. 


31.1.2 If the medium is a uniform and isotropic dielectric 
(30.1.6), having no ferroelectric (18.4.1) or ferromagnetic 
(26.5.1) properties, then D = ¢,e,E and B = u,;poH, where 
e, and p, are constant scalar quantities independent both of 
the coordinates and of time. Here Maxwell’s equations (31.1.1) 
can be rewritten in the form 


ôH dE f 
curl E = — urto ` curl H = 8r80 ETIE div E=0 
and div H=0, 


* All equations in Chap. 34 are written in SI units. 
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or in projections on the axes of Cartesian coordinates: 


oF, Ey o ôH x ôH, H, _ oe, Ex 
dy dz. Tho ap) Gy 7 rto’ 
OE, ðE, Oy 0H, 0H, _, , OEy 
ôz ôr M-a °> z OBE? 
OEy dE _ nae OHy _ ôHs _, , Ez 
Ox oy reat? OO oy Or 
OE, , Ey , dE, OH x v OH, _ 
dx t Oy tos” De tay o TO 


31.1.3 It follows from Maxwell’s equations (31.1.2) that the 
strength vectors E and H of a variable electromagnetic field 
and their projections onto the axes of Cartesian coordinates 
comply in a uniform, isotropic, nonconducting medium, with 
the wave equation (30.2.9): 


AE — Er£ouUrHo 1E =0, AH — &r2oUrLo <i - 0, 

AE x — Er8oMrHo A 0, AEy—®r&olrls Sl 

AE, — Er£oUrHo oe. =0, 

AH xy — Er8oHruo saat =0, AHy —e&r&oUrHo = =0, 
oH, 


AH, —®r€oUrLo =. 


ot? 
Thus a variable electromagnetic field really propagates in space 
in the form of waves with the phase velocity 


c 
eS where c= 


4 
V Erur ' 7 V Epo 


In vacuum £, = p, = 1. Hence, c is the velocity of electro- 
magnetic waves in vacuum (in free space). 


= 3 X 108 m/s. 
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31.1.4 Electromagnetic waves are transverse waves: vectors E 
and H of the field lie in a plane perpendicular to the direction 
of wave propagation, i.e. to the wave velocity vector v at the 
point being considered in the field. This can be most readily 
demonstrated by the example of a plane wave, propagating 
along the positive direction of axis OX (80.2.4): 


E=f(t—=}) and H=@ (+=). 


Vector E and H and their projections on the coordinate axes are 
independent of y and z: 


GE, OF, Oy y _ ðE, _ ðE, 


dy a Oy Cy 
and 

0H, OH,  OHy Oy al, ôH, D 
dy z — ôy əz  ðy OC 


It follows from Maxwell’s equations (31.1.2) that for the field 
of a plane wave, 

OEy OE xy _ GU OH e -. 

gp gg ge 

i.e. Ey and H, are independent of both coordinates and time. 
Hence, for a variable field of a plane 
wave E = H, = 0, and the vectors 
E and H are perpendicular to the 
direction of wave propagation: 


E=£,j+£,k and H=4H,j+4H,k, 


where j and k are unit vectors of the 
coordinate axes. 
31.1.5 Vectors E and H of the field 
Fig. 31.4 of an electromagnetic wave are mu- 

tually perpendicular, so that v, E 
and II form a right-hand set of three vectors (Fig. 31.1). As a 
matter of fact, for the field of a plane wave (31.1.4), 

x x 

Ey=fy Ga =| » L,=fe (;-=) ’ 


p 


[5 
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£ 


v 
it follows from Maxwell’s equations (31.1.2) that 


— dEy dH 
V Er£0 “ao = V urko 7 — and 


— dE — ail, 
V Er£0 iE = — y urto EJ , 


where — = t — z/v. Therefore, for a variable field of a plane 
wave 


, Cpe ; ErEp 
H =e a a) R Bey and (EH)=0. 
á Urto ” : Uro 4 E 


The mutually perpendicular vectors E and H oscillate in phase: 
they become zero simultaneously and reach their maximum 
values simultaneously. The relation between their magnitudes is 


V Er€o E = V urto H, 


which is valid for any travelling electromagnetic wave (30.2.1) 
regardless of the shape of its wavefronts (30.2.3). 

31.1.6 A monochromatic wave is an electromagnetic wave of a 
single definite frequence v, i.e. an electromagnetic sine wave. 
At each point of the electromagnetic field of a monochromatic 
wave, the projections of vectors E and H on the coordinate axes 
of an inertial frame of reference have harmonic oscillations 
(28.1.3) of the same frequency, equal to the wave frequency v. 
For example, in the case of a plane monochromatic wave, prop- 
agating along the positive direction of axis OX, 


Ey = A, sin (ot— kz), Hey Erëo E; 
Uro 
A ErEg 
E,=A,sin(@t—kz+ 9), A,= pos Ey, 
Hrlto 


where © = 27v is the angular frequency of the wave, k is the 
wavenumber (30.2.6), A, and A, are the amplitudes of Ey and 
E,, and ọ is the phase difference between the oscillations of 


E, and E, 
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31.1.7 For an arbitrary value of ọ (31.1.6), a plane monochromat- 
ic wave is elliptically polarized, i.e. at each point of the wave’s 
field, vectors E and H, while remaining mutually perpendicular, 
vary with time so that their heads describe ellipses lying in a 
plane perpendicular to the direction of wave propagation: 


E? EL 2EyE, 


Aj Aj A,A, 


cos P= sin? @, 


and 
H2? H? 2H „H 

y Ni z y- z = r&o 9 
ZA -AF A, A, cos @ r sin? @ 


In particular, when A, = A, and ọ = + (2m + 1) n/2, where 
m = 0, 1, 2, ..., the ellipses become circles: 


_ Ero 


Such a wave is said to be circularly polarized. 
When ọ = +mx, where m = 0, 1, 2,..., the ellipses degener- 
ate into straight lines: 


Hy H, 
F Aa 


Ey E, 
A, Ay = 6 and A, 


Such a wave is said to be linearly, or plane, polarized. Shown 
in Fig. 34.2 are the values of E and H of the field of a plane, 
linearly polarized, monochromatic wave at various points 
along the ray (OX axis) at the same instant of time. Axes OY 
and OZ are in the directions of oscillation of vectors E and H, 
respectively, so that E, = H, = 0. According to accepted 
terminology, the plane passing through the electric vector E 
and the ray is called the plane of polarization of the linearly 
polarized wave. Previously, this plano was called the plane 
of wave oscillation, and the plane of polarization was called the 
one passing through the magnetic vector H and the ray (this 
obsolete terminology may still be found in the literature). 
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31.1.8 An arbitrary plane monochromatic wave can be repre- 
sented as a combination of two plane monochromatic waves of 
the same frequency as the first wave and simultaneously prop- 
agating in the same direction as the first wave, but linearly 


Plane of polarization 
\ 


Fig. 31.2 


polarized in mutually perpendicular planes. For example, a 
plane monochromatic wave, propagating along the OX axis 
(31.1.6) can be dealt with as the result of superposition of a 
y-wave (E; = E,) and a z-wave (E, = E,). 


31.2 Energy of Electromagnetic Waves 


31.2.1 The volume energy density of an electromagnetic field 
in an isotropic medium is 


ie Set 


where e, and up are the relative permittivity (dielectric con- 
stant) and relative permeability of the medium. From the rela- 
tions between the magnitudes of vectors E and H of the field 
set up by an electromagnetic wave (31.1.5) it follows that the 
volume energy density of electromagnetic waves is equal to 


Ea = ¢,89H? = prm H2 = V ere gttrlty pal itr EH, 
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where c is the velocity of electromagnetic waves in free spaee 
(31.1.3). 

31.2.2 In the case of a plane, linearly polarized, monochromatic 
wave, propagating along the positive direction of axis OX, the 
field strength Æ = A sin (@t — kz). Consequently, the volume 
energy density of this wave is 


Ed = e,8)A? sin? (@t — kz). 


The value of Ed at each point of this field varies periodically 


with the frequency ©/n in a range from 0 to ES ge = €,£)A?. 


The average value of Ed over one period is proportional to the 
square of the amplitude of the field strength: 


T/O i 
(A) 
(Ea) = | Eù dt= -5 68,42. 
0 


If the plane monochromatic wave has an arbitrary (elliptical) 
polarization (31.1.7), then, according to Sect. 31.1.8, 


Ed = g£ [A? sin? (ot — kr) + 4A? sin? (ot — kz + @)] 
and 


(B4) = ene (Af + A}. 


31.2.3 The energy flux density vector (30.3.5) of an electro- 
magnetic wave is called the Umov-Poynting vector P (more 
frequently, it is called the Poynting vector). The velocity of 
energy transfer by a monochromatic wave is equal to the phase 


velocity v = c/V €,p, of this wave (30.3.5). Therefore, the 
Umov-Poynting vector for such a wave is 
P = Edy = [EH]. 


In the case of a plane, line ily polarized, monochromatic wave 
(31.2.2) the Umov-Poynting vector is in the direction of wave 
propagation and is numerically equal to 


P= jy A? sin? (ot — kz). 
HrHo 
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When a plane monochromatic wave is elliptically polarized 
(31.4.7), 


p= V2 [42 sin? (ot — kz) + 42 sin? (wt—kx-+ ọ)]. 
“riro 

31.2.4 The intensity I of an electromagnetic wave is a physical 
quantity numerically equal to the energy transferred by the 
wave in unit time through unit area of a surface located per- 
pendicular to the direction of wave propagation. The intensity 
of an electromagnetic wave is equal to the absolute value of the 
average Umov-Poynting vector during a period of its complete 
oscillation: 


1 = | (P). 


The intensity of a travelling monochromatic wave is 
I = (E4) v, 


where v is the phase velocity of the wave, and (£4) is the average 
value of the volume energy density of the field of the wave. 
The intensity of a plane, linearly polarized, travelling mono- 
chromatic wave (31.2.2) is proportional to the square of the 
ee A of oscillation of the wave’s field strength vector E. 
Thus 


pa. y Erto 43. 
2 Hruo 
When a plane monochromatic wave is elliptically polarized 


(31.1.7), its intensity is equal to the sum of the intensities of 
the y- and z-waves that form the wave being considered (31.1.8): 


1 Ere 

I= I-I 27 ro. (A? +- 43). 

y F Z 2 Urko ( iF 3) 
Note. The intensity of light, i.e. the electromagnetic waves dealt 
with in optics, is commonly understood to be the square of the 
amplitude of oscillation of the light wave’s field strength vec- 
tor E. 
31.2.5 The intensity J of a spherical, linearly polarized, monc- 
chromatic wave is related to the amplitude A of oscillation 
of vector E in the same way as for a plane wave (31.2.4). But 
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the amplitude and intensity of a spherical wave decrease as the 
distance r increases from the wave centre (30.3.6): 


A=aj/r and I = i/r. 


31.2.6 Electromagnetic waves exert pressure on the obstacles 
in their path that absorb and reflect the waves. The pressure of 
electromagnetic waves is explained by the fact that due to the 
effect of the waves’ electric field charged particles of matter are 
put into ordered motion and are subject to the action of Lorentz 
forces exerted by the magnetic field (24.4.1). 

According to Maxwell’s theory the pressure of electromagnetic 
waves is equal to 


== (1+ R) cos? i= (Eù) (1+ R) cos? i, 


where J is the intensity of the wave incident on the obstacle, 
visits velocity, (E42) is the average value of the volume energy 
density of the wave, i is the angle of incidence (31.5.2), and 
R is the reflection coefficient (31.5.6). 


31.3 Electromagnetic Radiation 


31.3.1 The process of exciting electromagnetic waves and emit- 
ting them into surrounding space by some system is called the 
radiation of these waves. The system itself is called the radiating 
system. The field set up by these electromagnetic waves is called 
the radiation field. According to the concepts of classical electro- 
dynamics, electromagnetic waves are excited by electric charges 
travelling with acceleration (in particular, an electric circuit 
in which the current varies). Vavilov-Cherenkov radiation can 
also occur in matter . (34.6.1). 
The simplest radiating system is the electric dipole (15.2.3) whose 
dipole moment pe varies with time. Such an “oscillating” dipole 
is called an oscillator, or elementary vibrator. Oscillators are 
extensively used in physics for simulating and calculating the 
radiation fields of real systems. If the radiating system is elec- 
trically neutral and its size is small compared to the length A 
of the waves it radiates, then in the wave zone of the system, 
i.e. at points whose distance from the system equals r >> A, the 
radiation field is almost the same as that of an oscillator having 
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the same electric dipole moment as the whole radiating system. 
31.3.2 A linear harmonic oscillator is an electric dipole whose 
moment pe varies according to the harmonic law: 


Pe = Po Sin ®t, where p = const. 


At point M of the wave zone of a linear harmonic oscillator, 
located in vacuum, the vectors E and H of the radiation field 
at the instant of time ¢ are equal to: 


E (¢) = — be [[por] r] sin (o@t— kr), 


| w? 
H (= = ATTA [por] sın (wt—kr), 

where r is the radius vector from point O, where the dipole is 
located, to point M (Fig. 31.3). Vectors E and H are mutually 


perpendicular and lie in a plane perpendicular to the radius 


Fig. 34.3 Fig. 31.4 


vector r. Hence, E, H and r form a right-hand set of three vectors. 
Vector E is tangent to the meridian, and vector H is tangent to 
the latitudinal circle passing through point M of a sphere of 
radius r having its centre at point O. 

31.3.3 The Umov-Poynting vector at point M of the wave zone 
of a linear harmonic oscillator is 


42 cin2 
P = B Pes r sin? (@t—kr), 


where @ is the angle between vectors pọ and r. 
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The intensity of the electromagnetic wave at point M is 
Up w4p% sin? 0 
32n?er? 


The dependence of J on 0 for a fixed value of r, shown in polar 
coordinates (Fig. 31.4), is called the polar directional diagram 
of oscillator (dipole) radiation. The dipole radiates most inten- 
sively in directions that make an angle 0 = x/2 with its axis, 
i.e. in a plane passing through the middle of the dipole and 
perpendicular to its axis. Along its axis (0 = 0 and 0 = zx) 
the dipole does not radiate at all. 

The average power of radiation of a linear harmonic oscillator is 


[= 


T 
OE oe . — Potr 
(PY) = 27r \ T sin 0 d0 = mion 
0 


31.3.4 The instantaneous power of radiation of charge q, trav- 
elling at the acceleration a, is 


pr =o 


If the charge executes harmonic oscillation of the amplitude J, 
and angular frequency œ, the average power of its radiation is 


_ Hogoi? 
Ee 4127c 


According to classical theory the radiation of light by atoms is 
due to the vibration of electrons in them. Owing to the con- 
sumption of energy to produce radiation this vibration is grad- 
ually damped, i.e. its amplitude varies according to the law 
(29.1.3): Ip = loo exp (—Bt), where 1), is the initial amplitude 
and Ê is the damping factor. The energy of vibration of an 
electron (28.2.2) is Æ = mew?l2/2, where me is the mass of the 
electron. Since —dE = (Pr) di = 2BE dt and q = —e, the 
relaxation time (29.1.3) of this vibration, called the average 
de-excitation time of the atom, is 
1 120CM e 


T = oS 
B [ge?e? 
For visible light © ~ 4 x 104% s-! and t ~ 2 x 107-8 s. 
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31.4 Electromagnetic Spectrum 


31.4.1 Depending upon the frequency v (or wavelength à = c/v 
in vacuum, where c is the velocity of electromagnetic waves in 
frec space), as well as on the way in which they are radiated 
and recorded (or detected), distinction is made between several 
kinds of electromagnetic waves: radio waves, optical radiation, 
\-rays and gamma rays. 

ltadio waves are electromagnetic waves of a length A in free space 
that exceeds 5 X 10-> m (correspondingly, v < 6 x 102 Hz). 
In connection with special features of their propagation and 
veneration, the full range of radio waves is commonly divided 
into nine bands (Table 31.1). 


TABLE 31.1 

Band name Wavelength, m | Frequency, Hz 
Very-long (myria- | Over 10! Less than 3 x 10! 
metric) waves i 
Long (kilometric) | 10t to 103 3 x 10! to 3x 105 
waves 
Medium  (hecto- | 103 to 10? 3x 105 to 3x 108 
metric) waves 
Short (decametric) | 102 to 10 3X 10° to 3 x 10’ 
waves 
Metric waves 10 to 1 3x 107 to 3x 108 
Decimetric waves | 4 to 0.1 3x 108 to 3x 10° 
Centimetric waves | 0.1 to 0.04 3x 10° to 3 x 101° 
Millimetric waves | 10-2 to 10-3 3x 101° to 3x 10H 
Submillimetric 10-2 to 5 x 10-5] 3 x 1014 to 6 x 1012 
waves 


31.4.2 Radio frequencies are divided into twelve bands accord- 
ing to the international regulations for radio communications.: 
These, bands and their corresponding wavelength in free space 
are listed in Table 31.2. 


28—0186 


(3 


ELECTROMAGNETIC WAVES 


434 


— 8 ee 


“ato UIATS JSH ƏY} JO Spueq IIIQ} 4sIIJ 94} “al ‘ZH £ 0} ZH € WoO] səpouənb 


SOABM Aouenb 

Wu p'O OF p| MIOMA |Z; pueg] 2ZHLE OE O|IHH -ə1} q3ıq 1əd4H Izy pueg 
SOARM Aouenhbedy 

uu } 0} OF OTIJƏWHTTTN |77 pueg| ZH9 00£ 04 0g | AHA 4314 SETA p; pueg 

guon 

Wd J OF OF |S3AVM 91179017030 |07 pueg) ZH9 0g 0}g| JHS -e1f YStq sednsg |0; pueg 
Aouenb 

W yO OF J} SeAeA JIWA | 6 pueg| ZHY £ 04} g'0| JHN -37 YsIy vayn | 6 pueg 
Aoguenb 


wu p 0} OF SOABM OJIN | 8 PUP) ZHW 00E 91 0E | AHA -ə17 YSIy 13A | 8 pueg 
W OF 0} OOF | SeAVM orızoweyoq | 4 pueg) ZHW 0g0rg| JH  <Aouenbosy q3ıH | L pueg 
W p'O 0} F |S9Aem 91I179W0700H | 9 pueg) ZHN £ 0}g0| AW AoUenbeIy umrpow| 9 pueg 
WY p OF OF | S9ALm OTIYOMMO[TY | G pueg| ZHY 00e 030g) AT Aouanbely mo | c pueg 


SOACM Aouenb 
uy OF 0} OOF oltjomelIAW | p pueg) 2H 0g 01 g| JTA -O1y MOT LI0A y pueg 
SIALM 
w OOF 0} OOOT | Ot1əworr0790H | £ pueg| ZHYE0E'O| JA Aduenbery oo10A | ¢ pueg 
Aouenb 
WN p 0} OF | SeARM 91I1JIWLeZIN | Z pueg) ZH 00g 91 0G! ATS - -917 moj dedng} z purg 
SIALVM » Aouendb 


WN OF 0} OOF | opyoweZowegoq | 7 pueg) ZH 0g 0} g| ATA -ə1} Mol Aowa | y pueg 
aaa eee 


uolyeu uolyeu 
uoT}eustsep [euolydo| -Srsap uoTJeUsIsap peuordo -3]sop 
orseg əojseg 
syrup pueg syu pueg 
uolyeu 
-SIsop pueq dAeM-OIpey uoTjBUsIsep pueq AQuanbadj-orpey 


eee 
o°ye AIAVL 


31.5] REFLECTION AND REFRACTION 435 


31.4.3 Optical radiation, or light, refers to electromagnetic waves 
electromagnetic radiation) of a wavelength in vacuum ranging 
Irom 10 nm to 1 mm (by convention). Optical radiation includes 
infrared, visible and ultraviolet radiation.* 

Infrared (IR) radiation is the electromagnetic radiation emitted 
by heated bodies. Its wavelength in vacuum is within the range 
from 1 mm to 770 nm (1 nm = 10- m). 

Visible radiation, or visible light, is electromagnetic radiation 
of a wavelength in vacuum from 770 to 380 nm. It is capable of 
directly producing a visible sensation in the human eye. 
Ultraviolet (UV) radiation is electromagnetic radiation of a 
wavelength in vacuum from 380 to 10 nm. 

31.4.4 X-radiation, or X-rays (also called Roentgen rays), is the 
electromagnetic radiation produced in the interaction of charged 
particles and photons (37.1.4) with atoms of matter. Its wave- 
length in vacuum ranges within the wide conditional limits 
from 10 or 100 nm to 0.01 or 1 pm (4 pm = 10-% m). 

Gamma radiation, or gamma rays, is electromagnetic radiation 
of a wavelength in vacuum less than 0.1 nm. It is emitted by 
excited atomic nuclei upon radioactive transformations and 
nuclear reactions. It is also formed in particle decay, in 
particle-antiparticle annihilation (43.5.5) and other processes. 


31.5 Reflection and Refraction of Electromagnetic 
Waves at the ‘Interface Between Two Dielectric Media 


31.5.1 The refractive index (absolute refractive index) of a medium 
is the quantity n, which is equal to the ratio of the velocity c 
of the electromagnetic waves in vacuum to their phase velocity 
v in the medium: n = c/v. From Sect. 31.1.3 it follows that 


n = V @,p,, where e, and yp, are the relative dielectric per- 
mittivity and relative magnetic permeability of the medium. 


* According to the recommendation of the Committee for 
Scientific and Engineering Terminology of the USSR Academy of 
Sciences [Physical Optics (Terminology), Collection of Recom- 
mended Terms, Issue 74, Nauka Publishers, 1968, Moscow] 
optical radiation also includes X-rays, the range of wavelengths 
of ee radiation being established from 0.1 A (10-4 m) 
to cm. 


28% 
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For Ven having no ferromagnetic properties, u, ~ 1 and 
WAV Ex. 

The relative refractive index of two media (of the second medium 
with respect to the first) is the quantity n, equal to the ratio 
of the refractive indices of these media: nj, = n,/n,. For non- 
ferromagnetic media nə = Veale 

31.5.2 When an electromagnetic wave falls on the interface 
between two media, it is partly reflected from the interface and 
partly refracted as it enters 
the second medium. In 
Fig. 31.5 the line AB is 
the plane interface between 
the media. Rays 7, 17’ and 
2 represent the directions 
of propagation of the inci- 
dent, reflected and refracted 
plane waves. They are called 
the incident, reflected and 
refracted rays, respectively, 
and the angles between them 
and the angles they make 
with the perpendicular ab to 
the interface at the point of 
incidence O are: iis the angle of incidence, i’ is the angle of 
reflection and r is the refractive angle. — E 
The plane of incidence is the plane passing through the incident 
ray and perpendicular to the interface at the point of incidence. 
31.5.3 The To of reflection and refraction of electromagnetic 
waves at the interface between two dielectric media can be 
obtained on the basis of the boundary conditions for an electro- 
magnetic field (27.4.4). In the first medium the field of the 
reflected wave (Erf! and Hrt!) is superposed on the field of the 
incident wave (E° and H°). In the second medium there is only 
the field (Erfr and H!r) of the refracted wave (propagating in 
this medium). Ilence, the boundary conditions are of the form 
(assuming that py. = Hm = 1): 


Fig. 34.5 


f 0 rfl rfr 
EL J-E = EIT, er (EXER) = trek), 


HLJ- HU = HI, H HAP = e. 
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llere E4, Hq, En and H, are the projections of vectors E and H 
on the tangential plane and the normal to the interface between 
lhe media. It follows from these relations that in the incidence 
of a plane monochromatic wave on a smooth plane interface 
hetween media, the following laws are complied with (regardless 
of the kind of polarization of this wave): 

(a) The reflected and refracted waves are also monochromatic 
waves of the same frequency as the incident one. 

(b) The law of reflection states that the reflected ray lies in the 
plane of incidence, and the angle of reflection is equal to the 
angle of incidence (i = i). 

(c) The law of refraction states that the refracted ray lies in the 
plane of incidence and that the refractive angle is related to the 
angle of incidence by Snell's law: 


sind _ Ne 
sinar m 
31.5.4 Making use of the boundary conditions (31.5.3) the rela- 


lions can be found between the phases, amplitudes and inten- 
sities of incident, reflected and refracted monochromatic waves. 


== ibe 


Fig. 31.6 Fig. 31.7 


Tt is sufficient for this purpose, according to Sect. 31.1.8, to 
know the above-indicated relations for linearly polarized waves 
of two types: p-waves, whose vector E = E, lies in the plane of 
incidence and whose vector H = H, is perpendicular to this 
plane (Fig. 34.6), and s-waves, whose vector E = E, is perpen- 
dicular to the plane of incidence and whose vector H Hy, lies 
in this plane (Fig. 31.7). 
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The relations between the amplitudes of oscillation of vector E 
in the incident (A°), reflected (Arf!) and refracted (Arfr) waves 


in the cases of p- and s-waves are expressed by the Fresnel for- 
mulas: 


Arti __ 40 tan (i—r) rfr 40 2cosisinr 

P `P tan(i-+r)’ `? ~~ “Psin(i+-r)cos(i—r) ’ 
ataa _Sin (i—r) , Atri 40 2 cos isin r 

s s sin(i+r) s s sin(i+r) 


In particular, with normal incidence of waves on the interface 
between media (i = r = 0), 


rl 40 nəı—1 rfr__ 40 2 

Ap 7 Ap nati’ Ap = 4p Noy +1 
rfl__ ss Moy — 1 rfr__ 40 2 

fs As yet? 48 At ET’ 


In the Fresnel formulas AS, and A? are positive quantities, 


whereas Av? and At are also positive for any possible angles 
of incidence and refractive angles. This indicates that the re- 
fracted and incident waves are in phase. Quantities A"! and Au 


can be either negative or positive. In the first case, illustrated 
in Figs. 31.6 and 31.7, the phase of oscillation of vector E is 
changed by x upon reflection (the phase of oscillation of vector H 
does not change). In the second case, the phase of oscillation of 
vector E remains unchanged (and, accordingly, the phase of 
oscillation of vector H changes by n). 

31.5.5 The values of the phase difference in the oscillation of 
vector E upon the reflection of p- and s-type electromagnetic 
waves are listed in Table 31.3, in accordance with the condi- 
tions (i.e. depending on the angle of incidence i and the relative 
refractive index ną of the media). 

The angle of incidence ip, at which the reflected and refracted 
rays are perpendicular to each other is called Brewster's angle. 
When i = ig, then i + r= x/2 and it follows from the law 
of wave refraction (31.5.3) that®tan ip, = na. It is evident 
from the Fresnel formulas (31.5.4) that at i = ipp, the amplitude 


At} = 0, i.e. the p-wave is not reflected from the interface 
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TABLE 31.3 
Conditions itr<en/2 itr>n/2 
(i < igr) (i > igr) 
net > 1 nə <i noi > 1 naj <i 
Hype df wave (i >r) (i<r) (i >r) (i <r) 
p-wave 1 0 0 - T 
s-wave T 0 Tt 0 


between the media, but completely passes from medium 4 
into medium 2. 
31.5.6 The coefficient of reflection (or B viae coefficient) R of an 
electromagnetic wave from the interface of two media is the 
ratio of the intensities (31.2.4) of the reflected and incident 
waves: 

[rfl Arfi \2 

T0 =( Ae } i 


The reflection coefficients of the p- and s-waves are equal to 


EEA tan? (i— r) __ sin? (i—r) 
P™ tan?(itr) ’ ~*~ sin? (i+r) ° 


In particular, with norma] incidence of waves on the interface 
between two media (i = r= 0 


When the incident wave is arbitrarily polarized, the reflection 
coefficient is 


mrs Rp4 RI! 
BFP DFE 
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where J® and J are the intensities of the p- and s-components of 
the incident wave whose intensity is 7° = 19 + I. 


31.5.7 The transmission factor, or transmittance, T is the ratio 
of the intensity of the transmitted (refracted) wave to that of 
the incident wave. Thus 


r rtr =y Ere Arfr \2 Arfr \2 
-oV B (Atom (AY, 


The transmission factors for the p- and s-waves are 


4 cos? i sin i sin r 4 cos? i sin į sinr 
T p= TT anan A E F T ES Gama 
sin? (i+ r) 


~ sin? (i+ r) cos? (i—r) ’ 


In particular, upon normal incidence of waves on the interfaca 
between media (i = r= 0), 


Ano, 


p (na -H14)2 ° 


31.5.8 When no, = (na'n) < 1, the refractive angle is greater 
than the angle of incidence: sin r = sin i/n., and r >i. The 
angle of incidence at which the refractive angle becomes equal to 
m/2 is called the critical angle i,,. Angle i,, = arc sin na. When 
i> ier, the intensities of the reflected and incident waves are 
the same, i.e. the wave is completely reflected from the interface 
between the media (R = 1). This phenomenon is called total 
internal reflection. 


Tp=T 


31.6 Doppler Effect 


31.6.1 Upon motion of the source and detector of electromagnetic 
waves toward each other the Doppler effect is observed (30.6.1). 
The laws governing this phenomenon for electromagnetic waves 
can be established on the basis of the special theory of relativity. 
Assume the detector D to be stationary with respect to an inertial 
frame of reference K, and that the source S travels with respect 
to K along the positive direction of axis OX at the velocity V 


31.6] DOPPLER EFFECT 441 


(Fig. 34.8). Source S is stationary in frame of reference K’ 
and is located at its origin of coordinates. The axes of coordinate 
systems K and K’ are parallel pairwise (axis O'X’ coincides 
with axis OX). Illustrated in Fig. 34.8 is the position of source S 
at the instant of time ¢ = ť = 0. According to Einstein’s rela- 


Fig. 31.8 


tivity principle (5.1.2), the equation of a spherical monochro- 
matic wave (30.2.8), sent by the source at this instant of time 
in the direction of the detector D, is of identical form in 
frames of reference K and K’: 


~~ a 5 
g= = pi(at+kx cos 0+ky sin 8+6) i 
and 

r 
[=o ei(o’t’+h’x’ cos O’-+h’y’ sin 0740’) (b) 


Here ® = @, and © are the angular frequencies of vibration 
of the source and detector, k = w/c and k’ = o’/e are the 
wavenumbers (it is assumed that the wave propagates in va- 
cuum), and 9 and 6’ are the angles between the directions of 
observation and of velocity V (OX axis), measured in the frames 
of reference of the detector (K) and for the source (XK’), 
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Expression (b) should be obtained from , expression (a) by re- 
placing the variables z, y and t by z’, y’ and ?’ in accordance 
with Lorentz’s transformations (5.3. 2): 


w’t’+k’z' cos 0’ +k’y’ sin 0’ +8’ 
t Vr [c? z’+Vt’ : 
= 0 | -——_:__:™:!: k | — ] cos 9 ky’ sin® 6. 
( ViVi )+ (a) ea 

Consequently, 

ETE a E LE 

= VIVIA (14 A cos 6 : 

k V 
k’ cos 0’ = — (co 0 -=> 
imag (oe ote) 

k' sin 0’ = ksin® and 6’ = ô. 


Hence, the relations describing the Doppler effect for electro- 
magnetic waves in vacuum are of the form: 


a ts Aaa and yei ZO 
1+- cos 8 14+ + cos 6 


31.6.2 At low velocities of the wave source with respect to the 
detector (V < c), the relativistic formula for the Doppler effect 
(31.6.1) coincides with the classical formula (30.6.3): 


væv (1——cos 0), 


When the source travels with respect to the detector along the 
straight linc that joins them (0 = 0 or x), the longitudinal 
Doppler effect is observed. If the source and detector approach 
each other (0 = x) 


LESTA 
v=» 4 - Eye > Vos 


whereas if they travel away from each other (0 = 0) 


1—V/[c 
1+V/e 
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31.6.3 Following from relativistic theory of the Doppler effect 
(31.6.4) is the existence of the transverse Doppler effect. It is 
observed when @ = 7/2 and 0 = 37/2, i.e. when the source 
travels perpendicular to the line of observation. Thus 


v= vo V 1—(V/c)?. 


The transverse Doppler effect is substantially weaker than the 
longitudinal effect (due to the smallness of V/c). This effect is 
due to the different passage of time in the frames of reference in 
which the detector and source of waves are fixed. Therefore, 
the experimental detection of the transverse Doppler effect 
was one of the most vital experimental proofs of the special theo- 
ry of relativity. 

31.6.4%The Doppler effect, associated with the thermal motion of 
the gas? atoms! that radiate light, leads to Doppler broadening 
of spectral lines.:The range of frequencies of light recorded by 
the detector extends from vy — Avp/2 to vo + Avp/2, where 
Vv) is the frequency of monochromatic radiation of a stationary 
atom. The amount of Doppler broadening Avp & V T/m, 
where T is the absolute temperature and m is the mass of the 
atom. 


PART FIVE OPTICS 


CHAPTER 32 INTERFERENCE OF LIGHT 


32.1 Monochromaticify and Time Coherence of Light 


32.1.1 Optics is the branch of physics concerned with the study 
of the nature of light (31.4.3), the laws of its production, propa- 
gation and interaction with matter. Wave optics deals with opti- 
cal phenomena in which the wave nature of light is manifested 
(for instance, interference, diffraction, polarization and disper- 
sion of light). Since light comprises electromagnetic waves, wave 
optics is based on Maxwell’s equations (27.4.3) and the rela- 
tions that follow from them for electromagnetic waves (31.1). 
Classical wave optics deals with media that are linear with 
respect to their optical properties (30.1.7), i.e. media in which 
the dielectric constant e, and the magnetic permeability u, are 
independent of the intensity of the light (31.2.4). Consequently, 
the principle of superposition (30.4.1) is valid in wave optics. 
Phenomena that are observed in the propagation of light in 
nonlinear media are investigated in nonlinear optics. Nonlinear 
optical effects become appreciable at the very high intensities of 
light radiated by powerful lasers (39.6.8). 

32.1.2 It has been experimentally established that the effect of 
light on a photocell, photographic film, fluorescent screen or 
other devices that record light is determined by the electric 
vector E of the electromagnetic field set up by the light wave. 
The same conclusion is reached by classical electronic theory, 
according to which the processes caused in matter by light are 
associated with the effect of the field of the light wave on charged 
particles of matter, i.e. on the electrons and ions. The frequency 
of visible and more short-waved light is so high (v > 1015 Hz) 
that only electrons can execute forced vibration anywhere near 
appreciable in amplitude. The force exerted by the electro- 
magnetic field (24.1.5) on an electron is 


F = —e {E + [v,B]} = —e {E + ypho [vH]}. 
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ilere —e and vı are the charge and velocity of the electron, 
and B == wppoll is the vector of magnetic induction. It follows 
from Sect. 34.1.5 that the absolute value of the magnetic com- 
ponent of force F is substantially less than its electrical com- 
ponent. Thus 


Hrhol [Vi] < urugy H = ILE <E and F ~ —eF, 


because the velocity of electromagnetic waves is v ~ 108 m/s, 
and the velocity of the electron in the atom upon forced vibra- 
lions set up by light is v, ~ 10° m/s. 

32.1.3 In superposing light from two nonlaser sources* (for 
instance, two identical gas-discharge lamps), or even from 
different portions of the same source, no interference (30.5.2) 
is observed. Consequently, independent sources of light are 
incoherent (30.5.1) and their radiation is nonmonochromatic 
(34.1.6). The reason for this resides in the very mechanism by 
which light is radiated by the atoms (or molecules or ions) 
of the source. The excited atom radiates during an extremely 
short de-excitation time, of the order of t ~ 10-8 s (31.3.4), 
after which it returns to its normal (unexcited) state, having 
expended its surplus’ energy on radiation. Following a certain 
period of time the atom may again be excited, obtaining energy 
externally, and begin to emit radiation again. Such intermittent 
radiation of light by atoms in the form of separate short im- 
pulses—wave trains, or packets—is typical of any light source, 
regardless of the specific processes that occur in the source and 
excite its atoms. In spontaneous radiation (39.6.6), the atoms 
emit radiation independently of one another with random initial 
phases that vary without any order from one event of emission 
lo another. Therefore, spontaneously radiating atoms are inco- 
herent sources of light. 

Induced, or stimulated, emission of radiation is quite 
another matter. It is produced in a nonequilibrium (active) 
medium by the action of a variable electromagnetic field (39.6.1). 
Stimulated emission of radiation by all the particles of a system 


* This refers to ordinary light sources (incandescent lamps, 
discharge lamps, electric arcs, etc.), based on the principle of 
spontaneous radiation (39.6.6). 
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is coherent with the monochromatic radiation that it is produced 
by, having the same frequency, polarization and direction of 
propagation. These features of stimulated emission of radiation 
are made use of in lasers and masers (39.8.8). 

32.1.4 A real wave, radiated during a limited period of time 
and encompassing a limited region of space, is not monochroma- 
tic. The spectrum of its angular frequencies (28.4.7) is of finite 
width Ao, i.e. it includes angular frequencies from © — Aw/2 
to œ + Aw/2. During the time interval At< Teon = M/A 
such a wave can be considered as an approximately monochroma- 
tic wave with the angular frequency ©. The quantity Teoh is 
called the time coherence of a nonmonochromatic wave. During 
the time interval Tcop the phase difference of the vibrations 
corresponding to waves of the frequencies œ + Aw/2 and o — 
Aw/2 changes by x. A wave with the angular frequency © and 
phase velocity v propagates during this time over the distance 
leoh = UTcoh = stvo/ Ao. 

The quantity leon is called the coherence length, or the harmonic 
wave train length, of the corresponding nonmonochromatic wave 
being considered. The closer a given wave is to a monochromatic 
one, the less the width Aw of its frequency spectrum and the 
greater its coherence time and coherence length. For example, 
for visible sunlight, having a continuous frequency spectrum 
from 4 X 10% to 8 X 10% Hz, Teoh =% 107 s and leoh S 
zæ 10-* m. The coherence time of stimulated radiation is sub- 
stantially longer than the de-excitation time of the atom (31.3.4). 
For example, tego, reaches 10-5 s for a continuous-wave laser 
and the coherence length Ico, ~ 103 m. 


32.2 Interference of Light. Spatial Coherence of Light 


32.2.4 To obtain coherent light waves by means of conventional 
(nonlaser) light sources, light from a single source is split into 
two or more systems of waves. Each system represents the 
radiation of the same atoms of the source. Hence, by virtue of 
their common origin, these systems of waves are coherent with 
one another and interfere when superposed. Light can be split 
into coherent systems of waves by its reflection or refraction. 

Illustrated as an example in Fig. 32.1 is the principle of the 
Fresnel mirrors. Light from point source S is incident on two 
plane mirrors A,O and A,O arranged perpendicular to the 
plane of the drawing and joined along line O. The angle a be- 


- 
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tween the planes of the mirrors is very small. Light from source S 
propagates after being reflected as two beams having their 
centres at points Sı and S, which 
are the virtual images of source 
S in the mirrors. These beams 
are coherent and upon being su- 
perposed produce an interference 
pattern on screen Æ (the region 
BC is sometimes called the field 
of interference). The result of the 
interference at some point M on 
the screen depends upon the wave- 
length À of the light and the path 
difference of the waves (30.5.2) from 
the coherent virtual sources Sı 
and S, to point M. Thus 


A = ra — rı = MS, — MS.. Fig 32.1 


The initial phases of vibration of sources S, and S, are the 
same. Hence, the conditions for interference maxima and minima 


(30.5.2) are of the form 

Tg — Ty = + mA for a maximum of the mth order 

me 0,4, 2); 

7 = +m = nå for a minimum of the mth order 


(m = 1, 2,...). 


The angle 2% at apex S between the two rays of light that con- 
verge at point M of the interference pattern after being reflected 
from mirrors A,O and 4,0 is called the aperture angle of the 
interference. This angle usually varies very little upon changing 
the position of point M within the limits of the field of inter- 
ference. 

32.2.2 The principles of the techniques used for observing inter- 
ference by means of the Fresnel biprism (Fig. 32.2) and the 
Billet split lens (Fig. 32.3) are similar to those of the Fresnel 
mirrors. The biprism consists of two identical triangular prisms 
joined together at their bases and prepared as a whole. The 
refracting anges œ at the upper and lower edges of the biprism 
are very small (of the order of a fraction of one degree). Light 
from source S is refracted in the biprism and propagates in it 
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in the form of two systems of waves, corresponding to the coherent 
virtual sources of light S4 and Sa. The interference of these waves 
is observed in the region of their overlapping on screen Æ. 

The Billet split lens consists of two halves L and L, of a collect- 
ing lens, cut along a diameter. The two halves are slightly 
displaced laterally (spread apart). In consequence, they produce 
{wo noncoinciding real images S$, and S, of the point source of 


Fig. 32.2 Fig. 32.3 


light S. The interference of light from these coherent secondary 
sources is observed on screen E£. The space between parts Z and 
L, of the split lens is covered by opaque screen A. 

The value of the aperture angle 2m of interference is shown in 
Figs. 32.2 and 32.3 for the central point Mọ of the interference 
pattern obtained by means of a biprism and a split lens. 
32.2.3 The width of an interference fringe (as the bands are 
called) is the distance between two adjacent interference maxima 
(or minima). When the Fresnel mirrors or similar devices (bi- 
prisms or split lenses) are employed to obtain interference, the 
width of an interference fringe (30.5.2) is AL/l. Here l is the 
distance between sources Sı and Sa, and L is the distance from 
them to screen E. The wavelength of visible light is extremely 
short (A œ% 5 X 10-7 m). Therefore, to obtain an interference 
fringe of a width that can be resolved by the eye, it is necessary 
to comply with the condition: 1< L. Consequently, the angle a 
in the Fresnel mirrors and the refracting angles a of the biprism 
must be very small. 


112] INTERFERENCE OF LIGHT 449 


The possibility of observing interference fringes also depends on 
lhe degree of contrast, i.e. on the degree of different illumination 
of the screen at the maxima and minima. The illuminance is 
proportional to the intensity J of the incident light. A quanti- 
tative characteristic of the degree of contrast of the interference 
pattern is the dimensionless quantity called the visibility of 
fringes 


V= Imax— Imin 
Imax t Imin ? 


where Imax and Imin are the values of the light intensities in the 
interference maxima and minima on the screen. The human eye 
reliably distinguishes fringes if their visibility V > 0.1, 
i.e. When Imin < 0.82 Imax- 

In superposing two likewise polarized coherent monochromatic 
waves, with the amplitudes and intensities A,, I4, A, and Io, 
the visibility of the fringes is 


24,4, _ 2V Il 
AtA lithe 


The visibility of the fringes is a maximum (V = 1) when A, = 


V = 


Qe 

32.2.4 In an interference arrangement of the type of the Fresnel 
inirrors, illuminated by a point source S (Fig. 32.1), the super- 
posed waves are never ideally monochromatic (32.1.3). Hence, 
these waves are only partly coherent. They are capable of inter- 
ference only under the condition that the vibration, excited by 
them at point M being considered on the screen, corresponds to 
one and the same harmonic train, or packet, of radiation of 
source S (32.1.4) i.e. if 


| ro — Ti | < VTeoh OF | r2 — 711 < leob. 


Here rg — rı is the path difference of the superposed waves, v is 
their velocity, and Teoh and leoh are the coherence time and 
coherence length of light from source S (32.1.4). Partly coherent 
vibrations are combined at point M. These vibrations are excited 
by the same source S at different instants of time, ¢ and ¢ + T, 
where t = | rg — rı |/v. Therefore, the visibility of the inter- 
ference pattern in such arrangements essentially depends on the 
time-coherence of the vibrations, which is limited by the mono- 
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chromaticity of the light from source S, i.e. its coherence time 
Teoh (82.1.4). At T< Teon, the superposed vibrations are practi- 


cally fully coherent and the visibility of the interference fringes 
(at equal intensity of the superposed waves) V ~ 1. If, however, 
TÆ Teoh, the superposed vibrations are incoherent and there 
is no interference (V = 0). 

Thus, to observe the interference of light at large path differ- 
ences rg — rı (and corresponding large values of t) it is necessary 
for the light to have a sufficiently long coherence time, i.e. a 
sufficiently high degree of monochromaticity. 

32.2.5 The positions on the screen of all the interference maxima, 
with the exception of the zero-order maximum, depend upon 
the wavelength of the light. For two wavelengths A, and Aa, the 
higher the order m to be observed, the more their maxima of 
the mth order are displaced from each other. Therefore, as m is 
increased, the visibility of interference fringes, obtained by the 
illumination of Fresnel mirrors with nonmonochromatic light, 
is impaired. Fringes, corresponding to different values of A, 
are superposed on one another, and the interference pattern is 
blurred. 

Assume that the wavelength of light is within the limits from A 
to A+ AA, and the angular frequency, from œ to œ — Aa, 
where Aw = 2xvAA/i2. Then, according to the Rayleigh criteri- 
on, the interference pattern is still distinguishable up to a maxi- 
mum of the mọ order for light with the wavelength À + AA 
(where AA > 0), which is superposed on the screen on the nearest 
interference minimum for light of wavelength A: 


Mg (à+ Ad) = (2mo + 1) + , from which img= oe 


Thus, interference can be observed at path differences of the 
waves that comply with the condition 


A2 [TW 
Ira —riıl <S FA = Ag = teon 


This result agrees with the assessment made in Sect. 32.2.4 on 
the basis of the concept of time coherence of vibrations. 

32.2.6 Partly coherent light, of a total intensity J, can be dealt 
with as the combination of two components: the coherent com- 
ponent of intensity yJ, where y is the degree of coherence of the 
light, and the incoherent component of the intensity (1—y) J. 


32.2) INTERFERENCE OF LIGHT | 4541 


When partly coherent waves are superposed, only their coherent 
components interfere. The incoherent components produce the 
uniformly illuminated background of the interference pattern. 
''herefore, as the degree of coherence of the light is reduced, the 
visibility V of the interference fringes deereases. Thus 


Y= 2V Ia Ye 
Iit 


When the intensities of partly coherent waves are the same, 
V = y. ; 
32.2.7 As a rule, in an interference device with mirrors (or 
a biprism), a brightly illuminated narrow slit, parallel to the 
joint O between the mirrors, is used instead of a point source of 
light. In this case, the interference patterns, obtained on the 
screen from the various portions along the length of the slit, 
are displaced with respect to one another in the direction of 
the slit S. Accordingly, a system of interference fringes, parallel 
to joint O of the mirrors, is observed on the screen. 

The visibility of the interference fringes is reduced as the width 
of the slit S is increased. This is due to the fact that the inter- 
ference fringes that could be obtained on the screen from various 
narrow slits into which slit S could be conceivably divided 
would be displaced with respect to one another. An interference 
pattern in monochromatic light of wavelength A is sharp and 
neta if the following approximate condition is complied 
with: 


b sin © < A/4, 


where 0 is the slit width and 2% is the aperture angle of inter- 
ference (32.2.1). 

32.2.8 The principle of Young's two-slit interference method is 
illustrated in Fig. 32.4. The source of light is a brightly illumi- 
nated narrow slit S in screen A;. Light from slit S falls on a sec- 
ond opaque screen A, having two identical narrow slits S; and 
Sg, parallel to slit S. Two systems of cylindrical waves are 
propagated in the space beyond screen Ag; their interference is 
observed on screen Æ. The visibility of the interference fringes for 
small path differences is determined mainly by the degree of 
agreement in the way the vibrations are executed at points of 


29% $ 
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slits S} and S». These slits can be considered as the “sources” of 
the waves interfering on the screen. 
32.2.9 The coherence of vibrations executed at the same instant 
of time at various points of a plane Q, perpendicular to the direc- 
tion of wave propagation, is called spatial coherence (as distin- 
guished from time coherence of 
vibrations, executed at one 
and the same point, but at 
different instants of time). 
Spatial coherence depends 
upon the conditions for the 
radiation and formation of 
light waves. For instance, a 
light wave emitted bya point 
source has complete spatial 
Fig§32.4 coherence. In the case of an 
eae ideal plane wave, the am- 
plitude and phase of the vi- 
bration are the same at all points of plane Q, i.e. there is also 
complete spatial coherence. Spatial coherence is also maintained 
over the whole cross section of a light beam emitted by a laser. 
In a real wave, emitted by a host of independent atoms of an 
extended nonlaser light source, the phase difference of the 
vibrations at two points K, and K, of plane Q is a random func- 
tion of time. The random variations of the phase difference in- 
crease with the distance between the points. The length of spatial 
coherence is the distance lge between points Kı and K, of plane 
Q at which the random changes of phase difference reach a val- 
ue equal to x. If in Young’s two-slit interference arrangement 
the distance l between slits S4 and S, is greater than or equal to 
lsc the visibility of the interference fringes equals zero. To 
ensure spatial coherence in the illumination of slits S, and Sz, 
the width b of inlet slit S should be sufficiently narrow. Thus 


b < ìàd/l and 0 < A/I, 


where d is the distance between screens A; and Az, and 0 = b/d 
is the angular dimension of the light source, i.e. slit S. 

The length of spatial coherence lsc = A/0 increases with the 
distance from the light source. For example, for a star of dia- 
meter D, located at the distance r, 0 = D/r and Ig, = Ar/D. 
The area of a circle of radius ig, is called the dimension of spatial 
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coherence, and the volume of a right cylinder having such a circle 
as its base and a height equal to length of the harmonic wave 
train (32.1.4) leoh = UTcop, is called the volume of coherence. 


32.3 Interference of Light in Thin Films 


32.3.1 Examples of light interference observed under everyday 
conditions are the iridescent colouring of thin films (soap bubbles, 
films of petroleum or other oils on the surface of water, trans- 
parent films of oxides on the surfaces of heat-treated metal com- 
ponents, called temper colours, etc.). The formation of partly 
coherent waves, displaying interference upon superposition, 
occurs in these cases due to the reflection of light, incident on the 
film, from its upper and lower surfaces. This interference depends 
upon the phase difference acquired by the superposed waves in 
the film and depending upon their optical path difference. 
32.3.2 The optical path, or distance, s of light is the product of 
the geometric path length Z travelled by light in the medium 
by the refractive index of the medium (31.5.1): s = nl. The 
quantity s is equal to the path travelled by light in vacuum 
during the time it covers the path J in the given medium. 

The optical-path difference of two waves is the difference in the 
optical path of these waves: As = są — sı (the optical-path 
difference is frequently denoted by A or 6). An optical-path 
difference As corresponds to a change in the phase by A® == 
== 20As/\), where A, = ndis the wavelength of the light in vacuum, 
À is the light wavelength in a medium with the refractive index n. 
Paths of propagation of waves, having the same optical length, 
are said to be tauiochronous. Light requires equal amounts 
of time to travel them. In an optical system (microscope, tele- 
scope, etc.), forexample, all possible paths of the rays of light 
from some point on the object to the corresponding point on its 
image are tautochronous. 

32.3.3 Assume that a plane monochromatic light wave 7 is 
incident at the angle i on a plane-parallel homogeneous plate of 
thickness d and transparent to light (Fig. 32.5). Owing to the 
reflection of light from the upper and lower surfaces of the plate, 
two plane waves propagate in the directions of the reflected rays 
I’ and 1”. Their optical-path difference is 


As=n (AB-++ BC)—n, (4D- =) = 2 dn Cos pe i 
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where n; and n are the refractive indices of the surrounding 
medium and of the plate, D is the foot of a perpendicular erected 
on ray 2’ from point C, member 4,/2 takes into account the 
phase difference of x as light is reflected from point A (341.5.5),* 
r is the refractive angle, and 
Ay is the wavelength of the 
light in vacuum. 

The conditions for interference 
maxima of reflection are 


2dn cos r = (2m-+1) oe or 


2d cos r=(2m-+1) = , where 


m=0, 1, 2,... is the order 
Fig. 32.5 of the interference maximum. 


The conditions for interference minima of reflection are 
2dn cos r = md, or 2d cos r = mA, 


where m = 1, 2, ... is the order of the interference minimum. 
The optical-path difference for the waves passing through the 
late (rays 2’ and 2”) is As = 2dn cos r, i.e. it differs from As 
or the reflected light by 4,/2. Therefore, the maxima of reflection 
correspond to the minima of transmission of light and vice versa. 
If the plate is illuminated with white light, it has additional 
colouring in the reflected and transmitted light. | 
The maximum thickness d of the plate at which it is still possi- 
ble to observe interference fringes is limited by the coherence time 
Tcoh Of the light (32.1.4). Thus 


2dn cos r < cTeop Or 2d cos r < leohs 


where lcoh = UTeoh = CTeoh/n is the coherence length. 
32.3.4 In the calculation of the optical-path difference of inter- 
fering waves in a plate (32.3.3) only two waves were taken into 


* It is assumed that i< igr and that n> nı. When 
n < nj, a phase difference of = occurs upon reflection of light 
from point B and As=2dncosr-+A,/2, i.e. it differs from 
the value of As for the case when n>n, by Ap, but this 
has no effect on the result of the interference. 
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necount. They correspond to the first reflection from the upper 
ind from the lower surfaces of the plate; the possibility of mul- 
tiple reflection of light was neglected. Such simplification is 
justified only ifthe intensity J, of the wave corresponding to the 
second reflection from the lower surface of the plate is substanti- 
ally less than the intensity I, of the wave corresponding to the 
first reflection. If R is the coefficient of reflection (31.5.6) from 
the upper and lower surfaces of the plate, then J, = R®J,. As 
n tule R2< 1. At the interface, for instance between air and 
lass (nai = 1.5), at angles 
of incidence i< 50°, the 
coefficient of reflection R < 
-< 0.05. In certain special 
cases, when J, is compar- 
able to J;, it is necessary 
to deal with multiwave inter- 
ference (32.4). 

32.3.5 In considering the 
interference of light in thin 
films, distinction is made 
between interference fringes 
of equal inclination (iso- Fie, 32.6 

clinic fringes) and those of EE 

equal thickness (isopachic 

fringes). Equal-inclination fringes are observed when a diverging 
(or converging) beam of light is incident at various angles i on 
n thin plane-parallel film. These, for instance, are the conditions 
that prevail when the film is illuminated by an extended source 
or scattered sunlight. Since d and n are the same all over the 
film, the optical-path difference of the interfering waves varies 
along the surface of the film only owing to the variation in the 
angle of incidence i. The conditions of interference are the same 
for all rays incident on the surface of the film and reflected from 
it at the same angle. Consequently, the interference pattern in 
this case is said to be made up of cqual-inclination fringes. 
Fqual-inclination fringes are observed on screen E arranged in 
the focal plane of a convergent lens L (Fig. 32.6). With no lens 
available the interference pattern could only be observed at 
infinity, at the point of intersection of the parallel rays 7’ and 
1”, 2’ and 2” etc. Therefore, equal-inclination fringes are said to 
be localized at infinity. For their visual observation it would be 
necessary for the eyes to accommodate to infinity. 
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32.3.6 Equal-thickness fringes are observed in reflecting a paral- 
lel or almost parallel beam of light rays (i = const) from a thin 
transparent film whose thickness d is not the same at various 
places. The optical-path difference of the interfering waves varies 
in going from certain points on the surface of the film to other 
points with a corresponding change in the film thickness d. Hence 
the conditions of interference are the same for points having 
the same values of d. This is why the interference pattern being 
considered is said to be made up of equal-thickness fringes. 
Equal-thickness fringes are localized close to the surface of the 
film. Hence, they ‘can be observed 
by accommodation of the eyes ‘prac- 
tically to the film’s surface. 

When light interferes in a thin trans- 
parent wedge with a small wedge 
angle a, equal-thickness fringes have 
the form of straight bands parallel 
to the edge of the wedge. If the wedge 
is illuminated by monochromatic light 
with the wavelength A, in’ vacuum, 
normally incident on the wedge sur- 
face (i = 0), the width of the inter- 
ference fringes (32.2.3) is equal to 
i,/2na, where nis the absolute re- 
fractive index of the wedge material. 
32.3.7 Equal-thickness fringes having 
the -form of concentric ‘rings, and 
called Newton's rings, are observed in 
the interference of light in the thin 
air clearance between a flat “glass 
plate A a a ee H L 
f P pressed tightly against the plate 
(Fig. 32.7). The flat surface of the lens is parallel to the plate 
surface and light is normally incident to the lens surface. The 
centre of Newton's rings coincides with point O of contact 
between the lens and plate. At a small distance r from point O 
the optical-path difference of the waves reflected from the upper 
and lower surfaces of the air clearance is 


Fig. 32.7 


32.4] MULTIWAVE INTERFERENCE 457 


where n ~ 1 (for air), and R is the radius of the convex surface 
on the lens. 

Note. In calculations concerning Newton's rings, the wave reflect- 
cd from the upper (flat) surface of the lens need not be taken 
into account because the optical-path difference between this 
wave and the waves reflected from the boundaries of the air 
clearance is greater than the coherence length for nonlaser 
light (32.3.3). 

In reflected monochromatic light with the wavelength in air 
à = y/n ~ do, the radii of the dark and bright Newton’s 
rings are 


—— RA 
Tdark>= V mRi and bright = V em + 1) —- ’ 


where m = 0, 1, 2, ... . The dark spot in the centre of the 
pattern corresponds to the change in the phase of the wave 
by x when it is reflected from the lower surface of the air clear- 
ance. If white light is incident on the lens, a central dark spot 
is observed in the reflected light, | 
surrounded by a system of col- 
oured rings. These rings correspond 
to the interference maxima of the 
reflection of light with various val- 
ucs of wavelength Aj. 


32.4 Multiwave Interference 


32.4.4. Special interference instru- 
ments, such as the diffraction grat- 
ing (33.3.4), the Fabry-Perot eta- 
lon and others are used to obtain Fig. 32.8 

interference of many light waves 

with close or equal amplitudes. 

The amplitude A of the resultant vibration and its intensity 
I = A? at arbitrary point M can be found by applying 
the method of vector diagrams for adding oscillations or vibra- 
tions having the same direction (28.4.2). | 
32.4.2 Shown in Fig. 32.8 is a vector diagram of the addition 
of vibrations in interferenceof N waves. At the point M being 
considered, these vibrations set up coherent vibrations in the 
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same direction with equal amplitudes A; = A, and a phase 
difference, independent of i, between (i + 1)th and ith vibra- 
tions. Thus @;,, (H) — ®; (t) = Apo. The amplitude of the 
resultant vibration is 


A=2 x 00; sin > , where a=2n—NAQ, 
and 00, = fi ® 
2| sin 22o 
2 
Hence, 
sin Aih ak sin? Napo 
A= 4; and I=J, ; 
sin 2g 2 sin? Ao 


where J, = A? is the intensity of the vibration set up at point M 
by each of the N interfering waves separately. 

32.4.3 The principal mazima in the interference of N waves 
(32.4.2) are at points M that comply with the condition: Apo = 
= + 2nn, where n = 0, 1, 2,... is the order of the principal 


-27e 0 O Rn AY, 
Fig. 32.9 


maximum. The amplitude and intensity of vibration at the 
principal maxima are: 


Amar = NA, and Imaz = Nj. 
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Interference minima (A = 0) comply with the condition: 
Ag) = +27p/N, where p may be any positive whole number 
except a multiple of N. 

The nature of the dependence of I/I; on Ag, (32.4.2) is shown 
in Fig. 32.9. Between each pair of adjacent interference minima 
there is a maximum, either a principal or a subordinate maxi- 
mum. At high NW values the intensities of the subordinate maxima 
are negligibly small compared to that of the principal maxima. 
The values Ao = + Cae + 2n/N) correspond to two minima 
that bound a principal maximum of the nth order. Therefore, 
the “width’ of the principal maximum, equal to 4x/N, is inverse- 
ly proportional to the nuinber N of interfering waves, and its 
intensity is proportional to N?. The 
fact that the interference pattern 
changes in such a manner as N is 
varied fully complies with the law of 
conservation of energy. The total tener- 
gy of vibration at all points of the 
screen on which the interference pat- 
tern is observed is proportional to N. 
32.4.4 If the number N of interfering 
waves (32.4.2) is increased indefinitely, 
and their amplitudes A; and phase __ 
differences Aq, are correspondingly re- Fig. 32.10 

duced so that NA; and N Ag, remain 

finite quantities, equal to A, and Ag, then, in the limit, the 
vector diagram (Fig. 32.8) assumes the form illustrated in 
Fig. 32.40. Vector A, the amplitude of the resultant vibration, 
closes the arc BC of a circle. The length of this arc is equal to Ag 
and its central angle < BOC = Ag. Therefore, the radius of 


the circle OB = A,/Ag, and the amplitude A and intensity J 
of the resultant vibration equal 


sin LL sin? a, 
A= Ao Ag and m ’ 
i 2 


where I, = Af, 
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Interference minima are at points of the interference pattern 
for which 


Ag = + 2mm (where m = 1, 2, ...). 


Interference maxima are at points for which Ag = +2hk,n, 
where m = 0, 1, 2,... is the order of the maximum. The values 


I 


-67 -x -er O On m br Ag 


Fig. 32.44 


of factor k,, are determined from the transcendental equation 
tan km = kmn. For the central, zero-order maximum kg = 0 
and Ag-=-0. The amplitude and intensity of vibration at a zero- 
order maximum are equal to A, and J,. For all other maxima 
(m> 1) it can be assumed approximately that i 


km = (2m + 1)/2 and Ag = + (2m + 1) x. 


Accordingly, the ratio of the intensities for the mth and zero- 
order maxima equals 


e ee 
L mpn" 
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TABLE 32.1 
Order of the maximum | 0 | 1 2 | 3 4 
Fond ta | 1 0.045 | 0.016 | 0.008 | 0.005 


This ratio decreases rapidly with an increase of m (Table 32.1). 
The shape of the curve showing the dependence of J on Aq is 
shown in Fig. 32.114. 


CHAPTER 33 DIFFRACTION OF LIGHT 
33.1 Huygens-Fresnel Principle 


33.1.1 If the position of the wavefront (30.2.3) is known at a cer- 
tain instant ¢ of time, as well as the velocity v of the wave, the 
position of the wavefront at the subsequent instant t +- Az can 
he determined on the basis of the Huygens principle. According 
to this principle, each point on the surface S (t), through which 
the wavefront passes at the instant ¢ can be regarded as a source 
of secondary waves, and the sought-for position S (¢-+ At) of 
the wavefront at the instant of time t -- At coincides with the 
envelope of all the secondary waves at this later time. It is 
assumed here that in a homogeneous medium the secondary waves 
radiate only forward, i.e. in directions making acute angles 
with the outward normals to the wavefront. In a homogeneous 
isotropic medium, the secondary waves are spherical (Fig. 33.1). 
33.4.2 The Huygens principle can be used to derive laws for the 
reflection and refraction of light at the interface between two 
media. The line MN in Fig. 33.2 represents the plane interface 
between two media in which the velocities of light equal v, 
and va. Incident on this interface at the angle i is a plane wave 
(rays 7 and 2). At the instant ¢ of time the wavefront (plane 
AB) reaches the interface at point A. Hence point A begins 
to radiate secondary waves, propagating both in the first medium 
(reflected wave) and in the second medium (transmitted wave). 
During the time Az required for the incident wave to travel the 
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TT SY 7 tl, 


distance BC (wher At = BCi/vq), the front of the secondary 
wave, radiated fr. ı point A, reaches points of a semisphere in 


the first medium having the radius Rj = vj;At = BC, and points 
of a semisphere in the second medium having the radius R = 


= v,At = v,BCluy. The front of the reflected wave (rays 7’ 


VAL vAt 


A 
we 
a 
S9 
we 
PS 
=e 
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S(t SÉ tdt 
Fig. 33.1 Fig. 33.2 


and 2’), propagating at the angle i’ of reflection is the plane DC, 
tangent to a sphere of radius Ry with its centre at point A. 
Correspondingly, the front of the transmitted (refracted) wave 
(rays i” and 2"), propagating at the refraction angle r, is the 
plane CE, tangent to a semisphere of radius R, with its centre 
at point A, The law of light reflection: i = i’ follows from the 
equality of triangles ACD and ACB. The law of light refraction 
follows from the rectangular triangles ACB and ACE, which 
have a common hypotenuse. Thus 


sini BC _ vu _ 

sinr JZE De 

where n.p is the relative refractive index of the second and 
first media (34.5.4). 

33.1.3 The Huygens principle is a purely geometrical one. It 
indicates no method of calculating the amplitude of the wave 


that is an envelope of the secondary waves. Therefore, the 
Huygens principle is insufficient for deriving the laws of propaga- 
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tion of light waves. An approximate method of solving this 
problem, which is a development of the Huygens principle on 
the basis of the idea proposed by Fresnel on the coherence of 
secondary waves and their interference upon superposition is 
called the Huygens-Fresnel principle. This principle can be 
expressed by the following series of propositions: 

(a) In calculating the amplitude of light vibration, set up by 
source So, at arbitrary point M, the source can be replaced by 
a system of secondary sources. These sources are small portions 
ds of any closed auxiliary surface S arranged so that it encloses 
the source Sọ, but does not enclose point M being considered. 
(b) The secondary sources are coherent with Sọ and with one 
another. Hence, the secondary waves they set up interfere upon 
superposition. Interference calculations are simplest when S 
is the wavefront for the light from source Sọ because, in this 
case, the phases of vibration of all secondary sources are the 
same. 

(c) The amplitude dA of the vibration set up at point M by 
a secondary source is proportional to the ratio of the area ds 
of the corresponding portion of wavefront S to the distance r 
from this portion to point M and depends upon the angle a 
between the outward normal to the wavefront and the direction 
of a line from the element ds of area to point M. 

Thus 

dA =- f (œ) 


ads 
r A 

where a is a quantity proportional to the amplitude of the pri- 
mary wave at points of the element ds of area; f (a) decreases 
monotonically from 1 at a=0 to0at a> n/2 (secondary 
sources do not radiate backward)*. 

(d) If part of surface S is an opaque screen, the corresponding 
secondary sources (closed by the screen) do not radiate, whereas 
the rest radiate in the same way as in the absence of screens.** 


* As shown by Kirchhoff, f (a) œ (1 + cos æ), i.e. the function 
becomes zero only at a = x; at small angles of diffraction a, 
however, this more exact relation is of no significance. l 
** Actually, the material of the opaque screen has an effect 
on the radiation of the open secondary sources that are close to 
the edge of the screen (at a distance of the order of the wavelength 
of the light). 
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33.1.4 On the basis of the Huygens-Fresnel principle, the law 
of rectilinear propagation of light in a homogeneous medium can 
be substantiated from the wave point of view. Assume that Sọ is 
a point source of monochromatic light (Fig. 33.3) and M is the 
point of observation. As the auxiliary surface S, a spherical 
surface of radius R is chosen so that the distance L from point M 


to this sphere (L = OM) is of the order of R. Next, surface S 


Fig. 33.3 


is divided into annular portions of small area, called Fresnel 
one as shown in Fig. 33.3, where À is the wavelength of the 
ight. 

Vibrations set up at point M by two adjacent zones are opposite 
in phase, because the path difference from like points of these 
zones to point M is equal to A/2. Consequently, the amplitude of 
the resultant vibration at point M is equal to A = A, — A, 

+ Á —A,-+..., where A; is the amplitude of vibration set 
up at point M by secondary sources, located within the limits of 
the ith zone. At i< L/i, the areas of all the zones are the same: 
o; = RLA (R + L), and very small (at R = L = 10 cm and 
à = 5 X 10 cm we obtain o; ~ 8 X 10-5 cm’). With an 
increase in i, the distance r; from the zone to point M also 
increases, as does angle œ; between the normal to the surface of 
the zone and the direction of a line from the zone to point M. 
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Therefore, according to the Huygens-Fresnel principle, 4, > 
> Aa > Åz >... and A; Z (Ai + A;j41)/2. Consequently, 
the amplitude of vibration at point M is equal to A x A,/2, 
i.c. the resultant effect of the whole open wavefront is equal to 
one-half of the effect of the first (central) Fresnel zone, whose 
radius is very small. Thus, it can be assumed that practically 
light is propagated from Sọ to M in a straight line. 

33.4.5 If an opaque screen, covering all, except the first, Fres- 
nel zones for the point M of observation is placed in the path of 
monochromatic light from source Sj, the amplitude and intensi- 
ty of light at point M are doubled and quadrupled, respectively, 
compared to their values in the absence of the screen: A =A, 
and J = A2. Light can be intensified to a substantially greater 
extent by means of a zone plate. This is a glass plate to which an 
opaque coating has been applied in the form of rings which 
enclose only the even (or only the odd) Fresnel zones, i.e. every 
other zone. A zone plate has the same effect on light as a con- 
verging lens. 


33.2 Fresnel Diffraction 


33.2.1 The diffraction of light is the whole set of phenomena due 
to the wave nature of light and observed in its propagation in 
a medium with sharply defined optical inhomogeneities (for 
example, in passing through apertures in opaque screens or 
close to the boundaries of opaque bodies). In its narrower sense, 
diffraction refers to the bending of light around small opaque 
obstacles, i.e. deviations from the laws of geometrical optics: 
Distinction is made between two kinds of light diffraction: 
Fresnel diffraction, or diffraction with convergent rays, and Fraun- 
hofer diffraction, or diffraction with parallel rays. Inthe former, 
a spherical or plane wave is incident on the obstacle and the 
diffraction pattern is observed on a screen at a finite distance 
behind the obstacle. In the latter, a plane wave is incident on 
the obstacle and the diffraction pattern is observed on a screen 
located in the focal plane of a converging lens placed in the 
path of the light that passes by the obstacle. In Fresnel diffrac- 
tion, a “diffraction image” of the obstacle is obtained on the 
screen; in Fraunhofer diffraction, the “diffraction image” is of the 
distant light source. | 

33.2.2 In the simplest Fresnel diffraction problems, the diffrac- 


30-0186 
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tion pattern can be investigated by using the method of Fresnel 
zones (33.1.4). 

Example 1. Fresnel diffraction by a small circular aperture in an 
opaque screen AB (Fig. 33.4). When the aperture is illuminated 
by monochromatic light of wavelength A, a system of alternat- 
ing dark and bright interference rings is observed on screen Æ, 
which is parallel to AB. These rings have a common centre at 
point O, located opposite the centre of the aperture. If for point O 


d 
S 
A B 
| på 
2 
+25 
£E g 
Fig. 33.4 Fig. 33.5 


an even number 2k (where k = 1, 2,.. -) of Fresnel zones fit into 
the aperture, there is a dark spot at point O; the amplitude of 
light at point O is less than in the absence of a screen: 


4 
Aw > (Ay —Agk) < 41/2, 


where A, is the amplitude corresponding to only the ith zone. If 
the number of zones is odd (2k -+ 1), there is a bright spot 
at point O and 


Aw + (Ay + Agha1) > A,/2. 


If the aperture is illuminated by white light, a system of con- 
centric coloured rings is observed on screen E. 

The number of Fresnel zones that fit into the aperture and the 
degree of contrast of the interference pattern depend upon the 
ratio of the diameter d of the aperture to the distance 1 between 
screens AB and E. As d/l is increased, the amplitude of light at 
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the centre of screen E approaches A,/2 and the degree of contrast 
of the interference rings is reduced. 

33.2.3 Example 2. Fresnel diffraction by a small disk (opaque 
round screen). The method of constructing open Fresnel zones 
on the wavefront S of the incident monochromatic spherical 
wave is illustrated in Fig. 33.5. The interference pattern on 
screen Æ consists of alternating bright and dark rings with their 
centre at point O, where the interference maximum (Poisson's 
spot) is always observed. The amplitude of light at point O is 
one-half of the amplitude A,, corresponding to the effect at this 
point of only the single first open Fresnel zone: A = A,/2. 
When the disk is illuminated by white light, a white spot is 
observed at the centre of the screen. This spot is surrounded by 
a system of concentric coloured rings. 

As the ratio of the diameter d of the disk to its distance J from 
screen Æ is increased, the brightness of Poisson’s spot is gradually 
reduced and the next dark ring becomes broader, forming a re- 
gion of shadow beyond the disk. 


33.3 Fraunhofer Diffraction 


33.3.1 Example 1. Diffraction of light by a narrow long slit 


in an opaque screen (Fig. 33.6). The slit width BC = b, and its 
length in the direction perpendicular 

to the plane of :the figure is lẹ b. 
Light is normally incident on the 
surface of the slit so that vibrations 
at all points of the slit are in phase. 
The diffraction pattern is observed on 
screen Æ, which is placed in the focal 
plane of converging lens L. Parallel 
rays BM and CN, travelling from the 
edges of the slit at the diffraction 
angle p to the direction of incident 
light, are collected by the lens at its 
secondary focus Fy. The lens are made 
so that the paths of the rays of light 
BMF, and DNF, where D is the Fig. 33.6 

foot of the perpendicular dropped 

from point B to ray CN, are tautochronous paths (32.3.2). 
Therefore, the result of the interference at point Fy of the screen 


30% 
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depends upon the path difference CD = b sinp and the wave- 
length à of the light. 

(a) Approximate solution. The slit can be divided in width into 
Fresnel zones, having the form of strips parallel to edge B, such 
that the path difference from the edges of the strips is equal 
to A/2. The number of such Fresnel zones that fit into the slit 
is equal to 2b/siny|/A. All the zones radiate light entirely 
equally in the direction being considered; and the vibrations 
set up at point Fy by two adjacent zones are equal in amplitude 
and opposite in phase. Therefore, when there is an even 
number of zones: 


b sin p= + 2m +, where m=1, 2, ..., 


and a diffraction minimum is observed (complete darkness). 
When the number of zones is odd: 


b sin spa sk (2m41) >, where m=1, 2, ..., 


and a diffraction maximum corresponding to one Fresnel zone, 
is observed. The brightest central maximum is observed at the 
principal focus Fy of the lens (p = 0). With an increase in m the 
width of the Fresnel zones and the intensity of the maxima are 
rapidly reduced. 

(b) Exact solution. The slit is divided into a very great number 
of identical very narrow strips, parallel to edge B. Secondary 
waves radiated by these elements of the slit, set up vibrations at 
point Fy that have the same small amplitudes, and their 
initial phases continuously fill an interval of width Ag = 
= (2nb sin w)/A. According to Sect. 32.4.4, the amplitude and 
intensity of light at point Fẹ, equal: 


sin mb ony sin? ub sin Ņp 
Ay=49|——“aosingg | t fe=lo asin 2 
A ae ae 


where A, and J, are the amplitude and intensity at the central 
maximum (wp = 0). 
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The condition for diffraction minima of light is the same as in 
the approximate solution: b sin w= +2mA/2, where m = 
=1,2,.... The condition for diffraction maxima is of the 
form 


nmbsin — xh siny 


tan 7, 7 


and it differs only slightly from the condition obtained by means 
of the Fresnel zones method. 

33.3.2 If white light, instead of monochromatic light, falls on 
the slit, the central maximum is white with iridescent colouring 
along its edges. All the other interference fringes are coloured 
because different angles ıp and different points Fẹ on the screen 
correspond to both minima and maxima of the same orders m, 
depending upon the wavelength Aj. 

As the width b of the slit is reduced, the width of the central 
maximum is increased because of the increase of angles p = 
‘= +arc sin (A/b), which correspond to minima of the first order, 
limiting the central maximum. At b < A the illumination of the 
screen is reduced monotonically from the centre (point Fo) 
toward the edges. When the slit is very wide (b> 4), a bright 
and sharp image of the light source, formed by lens Z according 
to the laws of geometrical optics, is seen on the screen. 
33.3.3 Example 2. Diffraction of light by a circular aperture. 
A plane monochromatic light wave is normally incident on the 
aperture, i.e. perpendicular to the plane of the aperture. The 
diffraction pattern is observed in the focal plane of a converging 
lens arranged beyond the aperture so that its optical axis is 
perpendicular to the plane of the aperture. The diffraction pattern 
has the form of a bright white spot at the principal focus F, 
of the lens and, concentric about this spot, alternating dark and 
bright rings. The intensity of the bright rings is very low com- 
pared to that J, of the central maximum, and decreases with an 
increase in their radius. For instance, the intensity of the nearest 
maximum of the first order to the central one is 7, < 0.02 Iq. 
The diffraction angle »,, corresponding to the first dark ring, 
bounding the central maximum, complies with the condition 


À 


sin p; 1.22 J?’ 
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where D is the diameter of the aperture and À is the wavelength 
of the light. 
If light falls on the aperture at a small angle «œ to the normal to 
the poe of the aperture, the interference pattern remains prac- 
tically unchanged, but its centre is displaced to a secondary 
focus of the lens, corresponding to the angle p = a. 
33.3.4 Example 3. Diffraction of light by a one-dimensional 
grating. A one-dimensitnal diffraction grating is a system of 
ooo T ooe a large number N of parallel 
slits in a screen, the slits being 
' igi | e} TA Ta y identical in width and separated 
SOA by opaque strips, also of equal 
width. Shown in Fig. 33.7 are 
only two adjacent slits of the 
grating. The quantity d=a+ b, 
where a = CD is the width of an 
opaque strip between two slits 
and b = BC is the slit width, is 
called the diffraction grating con- 
stant, or interval. 
E In analysing the diffraction pat- 
tern on screen Æ, mounted in 
the focal plane of converging 
Fig. 33.7 lens L, it is necessary to take 
into account the interference of 
the secondary waves not only 
from different portions of one slit, but from the different slits 
as well. If a plane monochromatic wave is normally incident 
on the grating, the vibrations at all points of the slits are in 
hase. 
Vibrations set up at the arbitrary point fF, of the focal plane 
of lens L by each of the slits coincide in amplitude (A,) and 
differ in phase. For each pair of adjacent slits the phase differ- 
ence Ag, between these vibrations is the same. It depends upon 
the wavelength A and the path difference from like points of the 
slits (for instance, points B and D), i.e. on the length KD = 


= d sin p, where K is the foot of the perpendicular, dropped 
from point B to the ray DN. Thus 


AG) = = sin p. 
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Therefore, according to Sects. 32.4.2 and 33.3.1, the amplitude 
and intensity of the resultant vibration at point F, equal 


és nb sin Yọ ia nNd sin p 


N x 
A= Ao | ae sin - ndsmý |’ 
‘a N 
sin? nb sin Yp sin? ne ap 
T= Io 


mb sin p \2 . 4 Ndsinyp ? 
( x ) sin 7 
where A, and J, are the amplitude and intensity of the vibra- 
tion at point Fo (i.e. at p = 0) due to the effect of a single slit. 
33.3.5 The principal minima in the diffraction of light by 
a diffraction grating occur at diffraction angles wp, corresponding 
Ci minima in diffraction by a single slit (33.3.1). 
Thus 


b sin p = +m (m = 1, 2, ...). 


In these directions none of the slits provide light (“each slit 
extinguishes itself”). 

The principal maxima (32.4.3) correspond to diffraction angles p 
that comply with the condition 


d sin p= + na, 


where n = 0, 1, 2, ... is the order of the principal maximum. 
Note. If certain values of p simultaneously comply with the 
conditions for both principal maxima and principal minima, 
then the principal maxima corresponding to these values of p 
are not observed. For example, if d = 2b, then all the even prin- 
cipal maxima (n = 2, 4, 6, etc.) are absent. 

The intensity of a principal maximum of the nth order is 


Nd \2 nnb 
=z uae n2 
= sin? —— , 
33.3.6 Between each two principal maxima, there are N — 1 
additional minima that comply with the condition 


oot, eet. ag A 
d sin t= : 
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where p may be any positive whole number with the exception 
of N, 2N, 3N, etc. Accordingly, there are N—2 additional max- 
ima whose intensity is negligibly low compared to that of 
the principal maxima. | 

The angular “width” of a principal maximum of the nth order, 
i.e. the difference between the values of the angles correspond- 
ing to the additional minima bounding this maximum, is 


2X 2A 


Atbn Ndcost, Leos, ’ 
where pn = arc sin (nA/d), and L = Nd is the length of the 
diffraction grating. For principal maxima of not particularly 
pel a angles ip, are small and cos pp ~ 1, so that Apn © 
x 2K L. 
33.3.7 With monochromatic light the diffraction pattern on 
screen Æ in the focal plane of Jens L (Fig. 33.7) has, at large N 
values, the form of narrow and 
bright principal maxima sepa- 
rated by practically dark and 
wide intervals. If the ratio d/b 
is an irrational number, the in- 
tensities of the principal maxi- 
ma (33.3.5) decrease monotoni- 
cally with an increase in the 
order n of the maximum. 
When the grating is illuminated 
by white light, an uncoloured 
central zero-order maximum is 
observed on the screen. On both 
sides of this maximum, diffrac- 
Fie. 33.8 tion spectra of the íst, 2nd, etc. 
a oy, orders are observed. These spec- 
tra appear as iridescent bands in 
which there is a continuous transition in colour from blue- 
met at the inner edge of the spectrum to red at the outer 
edge. 
33.3.8 Upon inclined incidence of light on a diffraction grating 
(Fig. 33.8), the condition for the principal maxima is of the 
orm 


d (sin p — sin?) = tna (n=O, 1, 2, ...), 
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where i is the angle of incidence of light on the surface of the 
grating. Frequently, the directions of the rays of light falling 
on the grating and diffracted from it are specified by the angles 
a, and a which these rays make with the axis OX of coordinates, 
arranged in the plane of the grating, perpendicular to the slits. 
In this case, the condition for the principal maxima can be 
rewritten in the form 


d (COS & — COS Qo) = + nA. 


33.3.9 Two screensare said to be complementary if the apertures 
in one of them exactly coincide in shape, size and relative posi- 
tion with the opaque areas in the other screen, and vice versa. 
Complementary, for example, are an opaque screen in the 
form of a circle (disk) of radius R and an opaque screen having 
a circular aperture of the same radius R. On the basis of the 
Ifuygens-Fresnel principle, it is possible to prove Babinet’s 
theorem (also called Babinet’s principle). This theorem states 
that in Fraunhofer diffraction by any screen, the intensity of 
diffracted light in any direction, except that of the propagation 
of a plane wave falling on the screen, should be the same as in 
diffraction by the complementary screen. 

33.3.10 Example 4. Diffraction by a large number of identical 
and identically oriented obstacles. The light intensity J at an 
arbitrary point M of the diffraction pattern, like in diffraction 
by a one-dimensional grating (33.3.4), can be presented in the 
form: J = fI. Here J, is the intensity at point M upon diffrac- 
tion of the same incident plane wave by a single obstacle. The 
function f depends only upon the number and relative arrange- 
ment of the obstacles. 

If the obstacles are arranged quite randomly with respect to one 
another and their number N is large, then f ~ N. In this case, 
the distribution of light intensity is the same as in diffraction 
by a single obstacle. But the intensity at each point of the 
diffraction pattern is N times as much. For example, in diffrac- 
tion of light by a glass plate coated with spherical dust parti- 
cles of diameter d, a system of bright concentric interference 
rings is observed. The sizes of these rings correspond to the 
diffraction of light by an opaque disk of diameter d or, in accor- 
dance with Babinet’s principle (33.3.9), by a round aperture of 
the same diameter, | | 
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33.4 Diffraction by a Space Lattice 


33.4.1 A space, or three-dimensional, diffraction grating is an 
optically nonhomogeneous medium in which the inhomogeneities 
are periodically repeated in varying all threespatial coordinates. 
An example of a space diffraction grating is the crystal lat- 
tice of a crystalline solid. Particles at the lattice points (atoms, 
molecules or ions) serve as orderly arranged centres that coher- 
ently scatter light falling on them. Assume that dı, d and d3 
are the lattice constants along the three coordinate axes £, n 
and ¢ that are oriented along three edges of the lattice intersect- 
ing in one of its points. Then in Fraunhofer diffraction (33.2.1), 
the principal maxima comply with the Laue conditions. 


dı (cos & — CoS Qo) = nA, d, (cos B — cos Bo) = nÀ and 
dz (cos y — COS Yo) = NA. 


Here &o, By and y,,and g, P and yare the angles between the coor- 
dinate axes —, ņ and C, and the directions of propagation of the 
incident and diffracted rays, respectively, nı, nọ and ng are 
whole numbers determining the order of the maximum, and A is 
the wavelength of the light. The Laue conditions follow from the 
relations (33.3.8) for diffraction maxima in inclined incidence of 
light on a one-dimensional diffraction grating. 

33.4.2 Of the three angles a, R and y (and likewise for angles 
Œo, Ro and yo) only two angles are independent because they 
must comply with one geometric relationship whose specific 
form depends upon the angles between the coordinate axes £, n 
and ¢. For example, if the coordinate axes are mutually per- 
pendicular, i.e. if the lattice is orthogonal, the geometric rela- 
tionship between a, B, and y is of the form: 


cos?a + cos?p + cos*y = 1. 


Therefore, with an arbitrarily given direction of incidence on 
a space lattice of monochromatic light with a given wavelength À, 
it is impossible, in general, to find values of a, B and y that 
simultaneously comply with the geometric relationship and 
Laue’s three conditions. The only exception is the zero-order 
maximum (nı = na = ng = 0), for which a = æa, B = Bo 
and y = Yo- 

In order to observe a diffraction maximum of the order (n,, nə 
and ns) at given values of the angles a», §, and yo, it is necessary 
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for the incident light to have a certain definite wavelength. 
lor instance, in the case of an orthogonal lattice, the wavelength 
should be 


n n n 
— | cos a)-+-—* cos Bo H — cos yo 
dy ds ds 


ty)? zay ( z)" i 
( di ) + do a ds 
If the wavelength A of the incident light is a fixed value, the 
Laue conditions and the geometric relationship between anglesa, 
B and y can simultaneously be complied with by a suitable 
choice of the direction of e incidence on the diffraction 
grating, i.e. of angles a, Bo and yo- 
33.4.3 It follows from Laue’s conditions that at A> 2dmay, 
where dmax is the maximum of the values of dı, da < and ds, 
there should be no maxima, except the zero-order one (nj = ng = 
= ng = 0). Light of such wavelengths propagates in a medium 
“without noticing” its inhomogeneities, i.e. without undergoing 
(liffraction. Hence, the condition A> 2dmax is called the 
condition for optical homogeneity of the “medium. 
The crystal lattice constants of solids are considerably Shortt 
than the wavelengths of visible light (d; ~ 5 X 10-1 m, where- 
as Avis) %5 X 1077 m). Hence, crystals are optically homo- 
yreneous media for visible light.* ‘At the same time, crystals are 
natural diffraction lattices for X rays (31.4.4). 
33.4.4 The diffraction of X rays by crystals can be interpreted as 
the result of interference of X rays that are mirror reflected from 
a system of parallel planes passing through the crystal lattice 
points. They are called planes of atoms, or lattice planes, of the 
crystal, The distance d between two adjacent lattice planes is 
called the interplanar spacing, or distance, and the angle O be- 
(ween the incident ray and the lattice plane (Fig. 33. 9) is called 
the grazing angle. The path difference of rays reflected from two 


adjacent lattice planes** is A = BC + BD = 2d sin 0. There- 


À = —2 


* Molecular scattering of visible light (34.3.1) is possible in 
crystals. It is associated with the impairment of their optical 
homogeneity due to fluctuations in density. 

** X rays arc not refracted in crystals becanse the values of the 
refractive index (31.5.1) of all crystals for electromagnetic radi- 
ition of such high frequency are practically equal to unity. 
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fore, according to Sect. 32.2.1, reflection is observed only in the 
directions, corresponding to diffraction maxima that comply 
with the Wulff-Bragg condition: 


2d sin 0 = mi, 


where m = 1, 2, ... is the order of the diffraction maximum. 
33.4.5 The experimental method of investigating the atomic 
structure of matter by analysing the diffraction of X radiation 
as it passes through the sub- 
stance being investigated is 
called X-ray structure analy- 
sis. This method proves most 
efficient in studying the 
structure of crystalline bod- 
ies. The diffraction pattern, 
registered on photographic 
film, is called the X-ray photo- 
graph, or pattern, of the speci- 
men. An X-ray photograph, 
obtained in the diffraction of a beam of “white” X radiation 
(with a continuous frequency spectrum) by a monocrystal is 
called a Laue diffraction pattern. It consists of discrete diffraction 
(Laue pattern) spots whose position is determined by the Laue 
conditions (33.4.1). A Debye-Scherrer-Hull powder photograph, 
or X-ray powder photograph, is an X-ray photograph obtained in 
the diffraction of a beam of monochromatic X radiation by 
a polycrystalline specimen (for instance, a crystalline powder). 
Such a photographic pattern consists of a system of concentric 
diffraction rings. The radii of the rings r = l tan 20, where l is 
the distance from the specimen to the plane of the X-ray powder 
photograph, perpendicular to the incident ray, and angles 6 
comply with the Wulff-Bragg condition (33.4.4). 


Fig. 33.9 


33.5 Resolving Power of Optical Instruments 


33.5.1 The image of the object in any optical system (telescope, 
microscope, photographic camera, etc.) is obtained by means of 
a restricted beam of light admitted into the instrument by the 
so-called aperture diaphragm. Such a diaphragm may be that of 
a photographic camera, lens mount of a telescope, etc. Reducing 
the diameter of the aperture diaphragm helps to decrease vari- 
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ous distortions of the image resulting from the use of wide beams 
of light and called geometrical aberrations of the optical system. 
But, owing to diffraction of light in an optical instrument, the 
image of a bright point is not a point, but a bright spot sur- 
rounded by a system of concentric interference rings (alternating 
dark and bright ones in the case of monochromatic light and 
iridescent in the case of white light). This phenomenon limits the 
resolving power (resolution) of the optical instrument, i.e. its 
capacity to provide separate images of two points of the object 
close to each other. | 

33.5.2 According to Rayleigh’s criterion, the images of two iden- 
tical point sources of light can still be seen separately if the 
central maximum of the diffraction pattern of one falls exactly 
on the first minimum of the diffraction pattern of the other. It 
follows from Sect. 33.3.3 that in accordance with the Rayleigh 
criterion, two close stars, observed in a telescope with mono- 
chromatic light of the wavelength A, can be resolved (viewed 
separately) if the angular separation (distance) between them is 


Ag > 1.22 A/D, 


where D is the diameter of the objective. The quantity (Ag), = 
= 1.22 X/D is called the angular resolution limit of the telescope. 
The reciprocal quantity 1/(Aq), is called the resolving power of 
ihe telescope. The resolving power of a telescope increases in 
proportion to the diameter of its objective. The condition for 
the resolution of'a terrestrial telescope and a photographic came- 
ra in viewing and photographing distant items coincides with 
that for an astronomical telescope. 

The angular resolution limit of the eye is determined by the 
diffraction of light by the pupil (D ~ 2 mm) and the granular 
structure of the retina. This limit is about one minute of arc. 
33.5.3. The resolving power of a microscope is characterized by 
the quantity (Al),, the shortest distance between two points of 
the object that are separately visible on the image. In the case 
of a self-luminous object, all of whose points can be assumed to 
be incoherent sources, 


0.619 

A 9 
where A, is the wavelength of the light in vacuum, A = n sin u 
is the numerical aperture of the objective, n is the refractive 


(Al) = 
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index of the medium between the object and the objective, and 
u is one half of the aperture angle of the beam of light issuing 
from a point of the object and entering the microscope objective 
(i.e. the angle between the outermost ray entering the objective 
and the optical axis). For non-self-luminous objects, the value 
of (Al), depends upon the illumination conditions. But in this 
case as well, (Al), >A,/A. 

The resolving power of a microscope can be improved either 
by reducing the wavelength ào, or by increasing the numerical 
aperture A (numerical aperture is abbreviated N.A.). The for- 
mer method is employed in ultraviolet and electron microscopy, 
and the latter, in the immersion microscope, in which the space 
between the objective and the object is filled with a transparent 
liquid having a refractive index n> 1. 


33.6 Holography 


33.6.1 Holography is a technique for obtaining a three-dimen- 
sional optical image of objects and is based on the phenomenon 
of wave interference. In contrast to the conventional photographic 
method, light-sensitive fine-grained photographic emulsion is 
used to record the relations, not only between the amplitudes 
(or their squares, i.e. the intensities) of light waves scattered by 
various small portions of the object’s surface, but also between 
the phases of these waves. 

The essence of the holographic method is made clear in 
Fig. 33.10. Photographic plate P (Fig. 33.102) records the 
interference pattern obtained in superposing wave 7, scattered 
by the object Q and called the signal wave, or object beam, and 
wave 2, coherent with wave 7 and having fixed values of ampli- 
tude and phase. Wave 2, called the reference wave, or reference 
beam, is emitted by the same source of light that illuminates the 
object and, after reflection from mirror B, falls directly on the 
photographic plate P. The interference pattern recorded on the 
photographic plate after it is developed is called the hologram of 
object Q. It consists of a very fine and intricate pattern of alter- 
nating interference maxima and minima in the blackening of 
the photographic emulsion and, in contrast to a photographic 
image, has no outward resemblance to the object. 

Holograms are obtained by the interference of light at large path 
differences, i.e. they require an extremely high degree of coher- 
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ence of the light (32.2.4). For this reason lasers (39.6.8) are used 
in holography as the light source. 

33.6.2 Reconstructing the image of the object from its hologram C 
(Fig. 33.106) is accomplished by passing reference wave 2, 
from the same laser used to make the hologram, through it as 
in the projection of a slide (diapositive). In this operation, the 
orientation of the plate with the hologram with respect to the 


Fig. 33.40 


reference wave should also be maintained. Wave 2 is diffracted 
by the hologram. As a result of this diffraction, two three- 
dimensional images of the object, a virtual and a real image, are 
seen. The virtual image Q’ is at the same place, with respect to 
the hologram, where the object Q was in taking the photograph. 
This image is visible in looking through the hologram as if it 
was a window. The real image Q” is on the other side of the 
hologram. It seems to be in the air in front of the hologram and 
is a mirror image of the object. 
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Ordinarily, the virtual image of the object is used; in visual 
perception it is identical to the object itself. It not only has 
the property of being three-dimensional, but its perspective va- 
ries with the position of the eye of the observer with respect to 
the hologram. For example, by moving the head along the 
hologram, it is possible to see behind some item located in the 
foreground of the holographic image. 

33.6.3 The interference pattern at each point of the hologram is 
determined by light scattered by all the points of the object. 
Hence, any portion of the hologram contains information 
about the whole object and can enable the image of the whole 
object to be restored if, in damaging the hologram, only this 
single portion remains intact. The smaller the size of the saved 
portion of the damaged hologram, the less the light diffracted 
by it in restoring the image. The luminance is correspondingly 
reduced and the sharpness of the holographic image of the 
object is impaired. Thus, a hologram has essential advantages 
with respect to reliability of storage of information over a con- 
ventional photograph or photographic negative, each element of 
which contains information only about the portion of the object 
depicted on it. 

Holographic recording of information is distinguished for its 
high capacity and compact form. A great many holograms can 
be recorded on a single photographic plate. For instance, it is 
sufficient, for this purpose, to record each hologram with 
a different value of the angle of incidence of the reference wave 
on the photographic plate. 

33.6.4 Holography can be used to obtain coloured three-dimen- 
sional images of objects. To make such a “coloured” hologram, 
use is made of the monochromatic light of three lasers with 
three primary colours (for instance, red, green and blue). 
The recording of the interference patterns, corresponding to 
light of three wavelengths, is accomplished simultaneously or 
consecutively on the same photographic plate. To reconstruct 
the coloured three-dimensional image of the object it is neces- 
sary to simultaneously direct onto the hologram the three refer- 
ence beams of monochromatic light that were used in recording. 
33.6.5 Three-dimensional holograms have special properties not 
possessed by the ordinary kind. They are obtained by using 
thick-layer photographic emulsions. Recorded on a three- 
dimensional hologram is a spatial interference pattern, instead 
of a plane one. This pattern is obtained in superposing the sig- 
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nal and reference waves. Such a hologram is similar to a spatial 
diffraction grating (lattice). It is capable of separating out of the 
incident white light, monochromatic light of the wavelength 
lhat was used in recording the hologram. Therefore, an image 
recorded as a three-dimensional hologram can be reconstructed 
by illuminating the hologram either by the corresponding mono- 
chromatic light or by white light. If a three-dimensional holo- 
gram is “coloured”, the coloured three-dimensional image can 
be reconstructed by simply illuminating the hologram with 
white light. 


CHAPTER 34 ABSORPTION, SCATTERING 
AND DISPERSION OF LIGHT. 
VAVILOV-CHERENKOV RADIATION 


34.1 Interaction of Light With Matter 


34.1.1 According to the concepts of classical electron theory, 
the variable clectromagnetic field of a light wave, propagating 
in a dielectric medium, sets up forced vibrations of the bound 
charges (electrons and ions) included in the composition of the 
molecules of the medium. Consequently, each molecule of the 
medium can be considered as a system of oscillators with dif- 
ferent angular frequencies of natural vibration (28.1.1). Ions are 
considerably more massive than electrons and execute appreci- 
able vibration only as the effect of low-frequency (infrared) 
radiation. In the frequency region of visible and ultraviolet 
light, the deciding role is played by the forced vibration of the 
external, more weakly bound electrons of atoms and molecules. 
They are called optical electrons. 

34.1.2 Electrons and ions that execute forced vibrations by 
the action of light radiate secondary light waves of the same 
frequency. The average distance between the molecules of a me- 
dium is only a small fraction of the coherence length (32.1.4) 
of the light. Therefore, the secondary waves radiated by a great 
number of neighbouring molecules are coherent and display 
interference when superposed. 


31--0186 
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If the medium is homogeneous and isotropic (30.1.6), interfer- 
ence results in the formation of a transmitted wave, whose phase 
velocity depends upon the frequency and whose direction of 
propagation coincides with that of the primary wave. 
34.1.3 When the medium is optically nonhomogeneous, the 
superposition of the primary and secondary waves results in 
scattering of the light (34.3.4). Finally, when light falls on the 
interface between two different media, interference results in 
the formation of a reflected wave in addition to the transmitted 
wave. Thus, a more or less considerable layer of particles of 
the medium, adjoining the reflecting surface, participates in 
the formation of the reflected wave. Therefore, upon total 
internal reflection (31.5.8), the electromagnetic field of the 
light wave is not completely cut off at an interface with an opti- 
cally less dense medium, but partly penetrates it. The field 
strength E, however, decreases very rapidly as the distance to 
the interface increases according to the equation 


E oc exp -5 V (sin?i/ng,) —1 | ; 
2 


where z is the distance from the interface, i is the angle of inci- 
dence (i > icr) Ag is the wavelength of light in the medium, 
ier is the critical angle (31.5.8), and ng, is the relative refractive 
index of the medium. 


34.2 Absorption of Light 


34.2.1 The absorption of light refers to the reduction in the 
energy of a light wave as it is propagated in a substance. It 
results from the conversion of energy of the wave into internal 
energy (9.1.2) of the substance or into energy of secondary radi- 
ation having a different spectral composition and other direc- 
tions of propagation (photoluminescence, Sect. 39.7.5). Light 
absorption may lead to heating of the substance, excitation and 
ionization ofits atoms or molecules, photochemical reactions and 
other processes in the substance. 

The absorption of light is described by the Bouguer-Lamberi law 
(also called the Lambert law), which states that the intensity J 
of a plane wave of monochromatic light decreases as it passes 
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through an absorbing medium according to the exponential 
relation 


[=I pe, 


where J, and J are the intensities of the light at the entrance to 
and exit from a layer of the medium of thickness z, and @ is the 
absorption factor (linear coefficient of absorption), which depends 
on the chemical nature and state of the substance and on the 
wavelength A of the light. 

Beer’s law is valid for dilute solutions of an absorbing substance 
in a nonabsorbing solvent: a = bc, where c is the concentration 
of the solution and b is a proportionality factor that is indepen- 
dent of c. Beer’s law does not hold for high concentrations be- 
cause of the effect of the interaction between the closely spaced 
molecules of the absorbing substance. 

34.2.2 According to the Bouguer-Lambert law, the equation of 
a plane, linearly polarized, monochromatic light wave, prop- 
agating in an absorbing medium along the positive direction 
of the OX axis, is of the form: 


E = Ege” ax/2 cos (wt — kz). 


Ifere Æ is the strength of the wave’s electric field at points with 
the coordinate z, Eo is the amplitude of E at points in the plane 
x = 0, œ is the angular frequency of the light, k = 2n/A = 
-= wn/c is the wavenumber, A is the wavelength of the light in 
the medium, c is the velocity of light in free space, and n is 
the refractive index of the medium. 

In the exponential form the equation of this wave is of the form 
(30.2.7): 


Ta B O et OTR] a Pe LERO 


where ñ=n (4 — iF) = n(1—ix) is the complex refractive 
index of the medium (i = yV —1), and x = a/2k = ad/4n is 
the absorptivity, or absorptive power, which characterizes the 
decrease in the intensity and amplitude of a plane wave as it 
propagates in a medium. 

34.2.3 The dependence of the absorption factor œ of a dielectric 
on the light’s wavelength à, characterizing the light absorption 


sik 
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spectrum in this medium, is related to the resonance phenomenon 
upon forced vibration of the electrons in the atoms and the 
atoms in the molecules of the dielectric. Dielectrics absorb light 
more or less selectively: absorption is great only in frequency 
regions close to the natural frequencies of vibrations of the 
electrons in the atoms and the atoms in the molecules. This 
phenomenon of resonance absorption of light is displayed most 
clearly in dilute monatomic gases (for example, the vapours of 
most metals), which have a line spectrum of light absorption. 
Discrete frequencies of intensive light absorption coincide with 
the frequencies of the self-radiation of excited atoms in these 
gases. 

Systems of closely spaced lines, forming absorption bands, are 
observed in gases having polyatomic molecules. The structure 
of the absorption bands is determined by the composition and 
structure of the molecules. Liquid and solid dielectrics have 
continuous absorption spectra, consisting of comparatively wide 
absorption bands, within which the absorption factor a reaches 
a quite high value and varies smoothly in accordance with the 
wavelength à. This kind of dependence of a on A in condensed 
media is due to the strong interaction between the particles of 
the medium, leading to the appearance ofa great number of addi- 
tional resonance frequencies. | 
34.2.4 At sufficiently high intensities of light, deviations from 
the Bouguer-Lambert law (34.2.1) are observed: the absorption 
factor of a dielectric medium begins to depend upon J, decreas- 
ing with an increase in 7. This phenomenon, inexplicable within 
the framework of classical light absorption theory, is readily 
interpreted in the quantum theory of the interaction of light 
with matter. In the absorption of light, a part of the molecules 
of the medium goes over into an excited state. The excited mole- 
cules cannot participate in further light absorption until they 
return, after expending their surplus energy, to the unexcited 
(“normal”) state. The higher the intensity of the light and the 
longer the average lifetime (t) of a molecule in the excited 
state, the greater the fraction of excited molecules in the medium. 
If this fraction of excited molecules is negligible, light is ab- 
sorbed in accordance with the Bouguer-Lambert law. Otherwise, 
a decreases with an increase in the intensity of the light. 

It is feasible to obtain a nonequilibrium state of the medium, 
in which the fraction of excited molecules is so large that the 
absorption factor of the medium becomes negative. This phenom- 
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enon is made use of ina quantum oscillator of radio waves and 
light (39.8.8). 

34.2.5 Metals in a condensed state contain a huge amount of 
conduction electrons and, therefore, have high electrical con- 
ductance. When subjected to light, the conduction electrons 
execute variable motion and radiate secondary waves. As 
a result of the superposition of the primary wave, falling on the 
surface of the metal, and the secondary wave, an intensive reflect- 
ed wave is formed and a relatively weak wave that enters the 
metal. The reflection coefficient (31.5.6) can reach 95 per cent 
and even more. It depends upon the surface finish of the metal, 
its electrical conductance and the light frequency. The refracted 
wave is rapidly absorbed in the metal. Its energy is consumed 
on the production of Joule heat, evolved by the conduction 
currents initiated by the action of light in the thin layer of 
metal adjacent to the surface. In the frequency region of infrared 
radiation, the optical properties of metals are determined main- 
ly by the conduction electrons. But in the region of visible 
light and especially ultraviolet radiation, bound electrons, in 
the ions of the metal, begin to play an appreciable role. This 
leads to a reduction in the reflection coefficient and to its pro- 
nounced dependence on the frequency. 


34.3 Scattering of Light 


34.3.1 The scattering of light is the phenomenon in which light 
is transformed by a substance. It is accompanied by a change 
in the direction of light propagation, and is manifested as an 
extrinsic glow of the substance. This glow is due to the forced 
vibration of the electrons in the atoms of the scattering medium 
by the action of the incident light. Light scattering occurs in 
an optically nonhomogeneous medium, whose refractive index 
varies irregularly from point to point due either to fluctuations 
in the density of the medium or to the presence in it of small 
foreign particles. In the former case, the light scattering is said 
to be molecular or Rayleigh scattering; in the second, light scat- 
tering in a turbid medium. Examples of turbid media are aerosols 
(smoke and fog), emulsions and colloidal solutions. 

34.3.2 The scattering of light in turbid media by particles whose 
size is small compared to the wavelength i, is called the Tyndal 
effect. A system of electrons, executing forced vibration in the 
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N, 


atoms of electrically isotropic particles of small size r, ~ 
zæ (0.1 to 0.2) A, is equivalent to one oscillating electrical dipole 
(linear harmonic oscillator). This dipole oscillates at the frequen- 
cy v of the light incident on it, and the intensity of the light it 
radiates is proportional to v* (31.3.3). Hence, the Rayleigh law 
is valid for scattered light. It states that the intensity J of scat- 
tered light is inversely proportional to the fourth power of the 
wavelength: Iœxà—. When white light passes through a finely 
divided turbid medium, short-wave (dark-to-light blue) light 
predominates in the scattered light, and long-wave (yellow-to- 
red) light in the transmitted light. This explains, for instance, 
the light-blue colour of the sky and the yellowish-red colour of 
the rising and setting sun. 

In scattering of natural light (34.1.1), the dependence of the 
ee a of the scattered light on the scattering angle 0 is of 
the form 


Ig = I n/a (4 + cos?0). 


Here Ig and J,,/, are the intensities of light scattered at angles 
of 0 and 7/2 to the direction of the primary light beam falling 
on a turbid medium. Light scattered at the arbitrary angle 6 is 
partly polarized (35.4.1); that scattered at the angle m/2 is 
completely linearly polarized (31.1.7): vector E of the field set 
up by this light is perpendicular to the plane passing through 
the incident and scattered rays. 

34.3.3 As the size r, of the inhomogeneities in a turbid medium 
is increased, the laws of light scattering are changed. At r, > A, 
the dependence of Ta on 9 is of complex form, the intensity of 
the forward scattering of light (in the directions 0 < 1/2) 
being greater than for backward scattering. This phenomenon is 
called the Mie effect. In this case, light scattered at the angle 
0 = n/2 is only partly polarized. The dependence of the scat- 
tered light intensity J on the wavelength À is of the form: [&%)-», 
where p < 4 and decreases with an increase in ry. At ry >A, 
the spectral compositions of the scattered and incident light 
practically coincide. This explains, for instance, the white col- 
our of the clouds. 

34.3.4 Molecular scattering in pure media, not containing any 
foreign impurities, is due to inhomogeneitics that occur in the 
process of random thermal motion of the particles of the medium. 
These inhomogeneities are associated with fluctuations in densi- 
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ty (14.6.1), whereas in media having anisotropic (polar) mole- 
cules, the inhomogeneities are also due to fluctuations in the 
orientation of these molecules (fluctuations of anisotropy). In 
true solutions, light may be scattered by fluctuations in concen- 
tration. Under ordinary conditions the size of the regions of the 
medium, corresponding to any appreciable fluctuations, is much 
less than the wavelength of visible light. Therefore, the depend- 
ence of the scattered light intensity on the wavelength A and 
angle 0O, as well as on the kind of polarization in molecular 
scattering, is analogous to the corresponding laws for the Tyn- 
dal effect. But, in contrast to the Tyndal effect, the intensity 
of molecular scattering of light depends upon the temperature of 
the medium, increasing with the temperature. 


34.4 Normal and Anomalous Light Dispersion 


34.4.1 The dispersion of light is the dependence of its phase 
velocity v in a medium on its frequency v. According to Sect. 
31.5.1, v = c/n, where c is the velocity of light in free space, 


a 


White 
Red 
Violet 


Fig. 34.1 


and n is the refractive index of the medium. Since ¢ is a univer- 
sal constant, the same for electromagnetic waves of any frequen- 
cy, the existence of the dispersion of light in a medium is due to 
the fact that its refractive index n depends upon the frequen- 

cy v. This dependence is readily detected, for example, in pas- 
sing a beam of white light through a prism made of some trans- 
parent medium. Observed on the screen, mounted behind the 
prism, is an iridescent band (Fig. 34.1) called a prismatic, or 
dispersion, spectrum. 
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34.4.2 The dependence of the refractive index n of a medium on 
the light frequency v is neither linear nor monotonic. The region 
of v values in which dn/dv > 0, i.e. n increases with v, corre- 
sponds to normal dispersion of light. Normal dispersion is found 
for substances transparent to light. For example, ordinary glass 
is transparent to visible light and, in this frequency region, nor- 
mal dispersion of light is observed in glass. A case of normal 
dispersion of light is illustrated in Fig. 34.1. 

The dispersion of light is said to be anomalous if dn/dv < 0, 
i.e. the refractive index of the medium decreases with an in- 
crease in v. Anomalous dispersion is found in frequency regions 
that correspond to intensive absorption of light in the given 
medium (34.2.1). In ordinary glass, for instance, these bands are 
in the infrared and ultraviolet parts of the spectrum. 

34.4.3 Depending upon the kind of dispersion light undergoes, 
its group velocity u can be either more or less than its phase 
velocity v. According to Sect. 30.4.3, the group velocity is 
related to the angular velocity œ of the wave and to its wave- 
number k by the equation: u = dw/dk. Since œ = 2nv and 
k = 2n/X = 2nnv/e, 


N c B v 
_ dn v dn ° 
MeN rs on dv 


In normal dispersion the group velocity is less than the phase 
velocity (u < v). In the case of anomalous dispersion u, > v 
and, in particular, if n + v dn/dv <1, then u >. This does 
not contradict the statement of the special theory of relativity 
that the velocity of any signal (including a light signal) cannot 
exceed c (5.1.3). The concept of group velocity correctly de- 
scribes only a signal whose “shape”, i.e. the distribution of ampli- 
tudes and energies along its “length”, does not change when the 
signal travels through the medium. But for light, this condition 
is complied with only approximately; the narrower the frequency 
spectrum of the signal and the less the light dispersion in the 
medium, the more exactly the condition is complied with. In 
the frequency regions corresponding to anomalous dispersion, 
the group velocity does not coincide with the velocity of the 
se because, owing to considerable dispersion of light, the 
“shape” of the signal changes rapidly as it propagates in the 
medium. 
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34.5 Classical Electron Theory of Light Dispersion 


34.5.1 Optically transparent media are nonmagnetic (ur ~ 1), 


so that their refractive index (31.5.1) n = V er = V 1+ Nas 
where ¢, and y, are the relative permittivity (dielectric con- 
stant) and the dielectric (or electric) susceptibility of the medium 
(18.2.4). Therefore, the dispersion of light can be regarded 
as the dependence of e, and Xe on the frequency of the variable 
electromagnetic field of the light, causing electronic polari- 
zation of the medium (48.2.2). If each atom (or molecule) of the 
medium contains one optical electron (34.1.1), the polarization 
vector of the medium (18.2.3) P, = —enor, where —e is the 
charge of the electron, r is its displacement from the equilibrium 
position, and n, is the concentration of atoms (or molecules) 
of the medium. On the other hand (18.2.4), Pe = £oXeE, where 
€ is the electric constant (14.2.7), and E is the strength of the 
electric field of the light. 

34.5.2 An optical electron executes forced vibrations under the 
effect of the following forces: 

(a) restoring quasi-elastic force (40.3.5) F est = — mgr, where 
m and œ are the mass of the electron and the angular fre- 
quency of its free undamped vibrations; 

(b) force of resistance F es; = —2Bm dr/dt, where Ñ is the damp- 
ing factor of free vibrations of the electron; 

(c) driving force F = —eE, exerted on the electron by a vari- 
able field of strength E. 

The equation of forced vibrations is 


d?r de gl 
ae 1 Ba tor — 


In the case of linearly polarized monochromatic light with the 
angular frequency œ, the field strength E = E, cos wt, where 
E, = const is the amplitude vector. If, in addition, the medium 
does not absorb light, then P =0 and the steady-state forced 
eee of the optical electron is executed according to the 
relation 


eE 


r= — Ąą 
m (0—0?) * 
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Here the polarization vector of the medium is 


ne E 


m (0 — o?) 


nge? 


P. = hee E 
: Eom (0—0?) 


and Ye= 


The dependence of the refractive index of the medium on © is 
of the form 


nge? 


E E E E 
+ Eom (wz — o?) 


34.5.3 At values of œ close to œ, the absorption of light in the 
medium can no longer be neglected, assuming that $ = 0. In an 
absorbing medium (i.e. at B = 0), vibrations of the optical 
electron and of vector P, are out of phase with respect to the 
oscillations of the field strength E (29.2.2): 


r = A cos (wt + o), 


where 

A = — serea 2.) Seeeee ree and tan poen a 
m V (0—0) + 40 RIS 

Accordingly 

Pe = ngoe? Eg COS (wt + Po) 


mV o T ao 


Introduced to describe the properties of a light-absorbing medi- 
um, along with the complex refractive index (34.2.2) ñ = 


= n (1 — ix), are the complex dielectric susceptibility Xe and the 


complex dielectric constant &,: 


and er=14 3e, whereas n2=1+%e. 
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Here P,and E are complex values of the polarization and 
strength of the field: 


P n E= E iot i 
m V (0—02) + 4P20? 

so that 

~ n e2 . 

n? = n? (1— ix)? = 1 4- ——_____"“________.. tto 
Eom V (o — W2)? + 4B2w ' 

2 
n? (1—x3)=1+ Nye” COS Po 


Eom V (wZ— 02)? + 4B20? 
ne? (wz — o?) 
Sitom [o o FAT 


nge? sin Po 2n e2Bw 


eom V (w2— 2)? + 4B2w? eom [(0} — 0)? +- 48202] ° 


2n2% = — 


34.5.4 In the classical electron theory of light dispersion in 
gases, each molecule of the gas is dealt with as a system of q 
linear oscillators. If m); and Bj; are the natural angular fre- 
quency and damping factor of the jth oscillator, then 


T (0—0) Í; 
nae ) J 

n? (1 — x?) = 4 -|- —— >; ze e E 
Eom = (w co)? + 4B; (2 


and 
q 
nae? w B ify 
rn = ee S 


O.. ANG 1, Ree 
= (o j; — 07)? + 4B. 


The dimensionless coefficient f; characterizes the contribution of 
the jth oscillator to the dispersion and absorption of light. It 
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is called the oscillator strength. In the classical theory of disper- 
sion, the values of œp; and f; are assumed to be known from 
experimental investigations. 


Fig. 34.2 


For gases, x & 1 and z differs only slightly from unity, so that 
n? — 1 = (n + 1) (n — 1) x 2 (n — 1). Therefore, the depen- 
dence of n on w is of the form 


q 2 
tea Soati a 
= 2& om (wp ;-- ©” )? + 4B 20" e 


=1 


The curve of this equation is shown in Fig. 34.2. 
Anomalous dispersion is observed close to each of the frequen- 


cies Wo;. 


34.6 Vavilov-Cherenkov Radiation 


34.6.1 Vavilov-Cherenkov radiation (or effect) is the radiation of 
light, differing from luminescence (39.7.1), emitted in the mo- 
tion of charged particles in a substance at velocities V greater 
than the phase velocity v of light in this substance. The condi- 
tion for the existence of this radiation is: c/n < V < c, where c 
is the velocity of light in free space (or in vacuum), and n > 1 
is the refractive index of the substance. 


34.6] VAVILOV-CHERENKOV RADIATION 493 


In the process of Vavilov-Cherenkov radiation, the energy and 
velocity of the free emitting particle is reduced, i.e. the particle 
Is retarded. But, in contrast to braking radiation (bremsstrah- 
iung) of slowly moving charged particles (31.3.4), resulting from 
their change in velocity, the reduction in the velocity of particles 
in Vavilov-Cherenkov radiation is itself a consequence of this 
radiation. In other words, if the loss of energy of the particle, 
consumed in emitting Vavilov-Cherenkov radiation, could be 
replenished in some way and the particle would continue travel- 
ling in the substance at a “faster-than-light” velocity (V > v), 
Vavilov-Cherenkov radiation would still be observed, whereas 
there would be no braking radiation in this case. 

34.6.2 A charged particle gives rise to short-duration polariza- 
lion (18.2.2) of the substance in the vicinity of the points 
through which it travels. Hence, the molecules of the medium, 
lying on the path of the particle, become temporarily acting 
coherent sources (30.5.1) of elementary electromagnetic waves 
that interfere upon superposition. 

If V < v= c/n, the elementary waves cancel one another. 
Assuming that the charged particle travels at the velocity V 
(where V < v) along the OX axis (Fig. 34.3) and is at points 
A and B at the instants of time ¢ and ¢t + At, respectively. The 
distance between A and B is 1 = VAt. The path difference of 
elementary waves, radiated from points A and B in the arbi- 
trary direction n, making the angle a with vector V, is 


A= DF =(v—Vcos2) At=I1 (+-—cos a | : 

It is possible, for each value of the wavelength A, to find a value 
l = 1,5 at which A = 4/2, so that the elementary waves cancel 
one another. Thus 


À 


lar = 2 T 
2 (F — 00s a | 


When / = laa, radiation, in the direction n from any point M 
of portion AB of the path of the charged particle, is cancelled 
upon interference with radiation in the same direction from 


a like point N on the adjacent portion BC = AB = la}, the 
distance between points M and N being MN = 1,,. Therefore, 
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upon motion of a charged particle at uniform velocity in a 
straight line in a substance at a “slower-than-light” velocity, 
the particle docs not emit radiation. 

34.6.3 If a particle travels in a substance at a “faster-than- 
light” velocity V > v = c/n, the value of 1,,, complying with 
the condition for the cancelling of elementary waves, 


A 


2 COS & 
y f 


can be found for all values of angle œ, with the exception of 


lan = 
2 


0 =are cos — = arc cos — 

E Vo © nV `’ 
For the direction 0 = a, the path difference of the elementary 
waves emitted by any two points A and B of the path of the 
charged particle (Fig. 34.3) equals zero. Thus 


A = DF = (v — V cos0) At = 0. 


Consequently, elementary waves, propagating in the direction 
a = Q, reinforce one another upon interference, forming a resul- 


Fig. 34.3 Fig. 34.4 


tant radiation in this direction. This is Vavilov-Cherenkov 
radiation. Light, emitted at each small portion of the path of 
a charged particle, is propagated along the elements of a cone, 
whose vertex O (Fig. 34.4) is located on this portion and whose 
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axis coincides with the path of the particle. The elements make 
the angle 8 = arc cos (c/nV) with the axis. The light is pola- 
rized so that vector E is normal to the surface of the cone, and 
vector II is tangent to it. 


CHAPTER 35 POLARIZATION OF LIGHT 


35.1 Polarization of Light in Reflection and Refraction 
at the Interface Between Two Dielectric Media 


35.1.4 Light emitted by ordinary (nonlaser) sources is made up 
of a set of a great many plane polarized wave trains, or pack- 
ets (22.1.3), whose electric vectors E vary along all possible 
directions perpendicular to the ray (30.2.1). Light is said to be 
natural, or unpolarized, if not one of the above-indicated direc- 
tions of vibration is predominant. In natural light, the resul- 
tant strength E oscillates at each point of the field in a direction 
a rapidly and randomly varies in a plane perpendicular to 
the ray. 

Light is said to be partially polarized if in it there is a predomi- 
nant direction of oscillation of vector E. Partially polarized 
light can be regarded as a collection (“mixture”) of natural and 
linearly polarized light, simultaneously propagating in a single 
direction. 

39.1.2 The polarization of light is the separation of linearly 
polarized light out of natural or partially polarized light. Special 
devices, called polarizers, are used for this purpose. Their prin- 
ciple of operation is based on the polarization of light upon 
being reflected or refracted by the interface between two dielec- 
tric media, as well as on the phenomenon of birefringence (35.2.1) 
and dichroism (35.2.10). The same devices can be utilized as 
analyzers, i.e. devices to determine the kind and degree of 
polarization of light. 

Assume that linearly polarized light, whose electric vector E,, is 
along the line p-p and oscillates with the amplitude Ap, falls 
on the analyzer perpendicularly to the plane of Fig. 35.1. As- 
sume further that the electric vector Ea of the light passed by the 
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analyzer is along line a-a, which makes the angle a with line 
p-p. The incident light can be represented in the form of two 
waves, linearly polarized in mutually perpendicular planes 
(31.1.8). The wave whose electric vector E, oscillates along 
the direction perpendicular to a-a, with the amplitude 4, = 
Ap sin a, cannot pass through the analyzer. But the other wave, 
whose electric vector E, oscillates along the direction of a-a, 
with the amplitude A, = Ap cosa 
passes completely through the analyz- 
er. Consequently, the amplitude of 
the light emerging from the ana- 
lyzer is 


Aa a As = Ay COS œŒ. 
The intensities J, and Jp of the lin- 
early polarized light incident on and 


emerging from the analyzer, respec- 
tively, are related by the Malus law: 


Ia= Ip cosa. 


The principal plane of a polarizer 
(or analyzer) is the plane of polari- 
zation [plane of oscillation, according to previous terminology 
(31.1.7)] of light passed through the polarizer (or analyzer). 
35.1.3 In investigating the laws of light polarization as a result 
of reflection and refraction of natural light, the light can be 
conveniently dealt with as a combination of linearly polarized 
waves, of equal intensities, of two types: s- and p-waves (31.5.4). 
The reflection coefficient (31.5.6) of the s-waves (Rs) is always 
greater than that (Rp) of the p-waves. Therefore, in contrast to 
incident natural light, reflected and transmitted (refracted) 
light is partially polarized. Oscillations of the electric field vec- 
tor E of the s-type (perpendicular to the plane of incidence) pre- 
dominate in reflected light, whereas oscillations of the p-type (in 
the plane of incidence) predominate in transmitted light. 
Brewsier’s law states that reflected light is completely linearly 
polarized at the angle of incidence i = igp, complying with the 
condition tan ip, = ng, where na is the relative refractive 
index of a medium reflecting light. 

The angle ig, is called Brewster's angle. When i = ip,, the 
reflected and refracted rays are perpendicular to cach other and 
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the reflection coefficient of the p-wave is Rp=0 (34.5.6). 
Ilence, only s-type waves are reflected. But their reflection 
coefficient is considerably less than unity (about 0.15 for glass), 
‘Thus, the transmitted light is only partially polarized. 
35.4.4 Polarization of transmitted light can be enhanced by 
subjecting the light to a number of consecutive reflections and 
refractions. This is done in a stack consisting of several identi- 
cal and parallel plates of some transparent dielectric (for instance, 
glass), set at the Brewster angle to the incident light beam. 
If the number of plates in the stack is sufficiently large, then 
thelight passing through the stack is found to be practically 
linearly polarized (p-type). In the absence of light absorption 
in the stack, the intensities J, and Ip of the reflected and trans- 
mitted linearly polarized light are the same and equal to one- 
half of the intensity 7, of the incident natural light. Thus 


1 


is=Ip=z lo. 


35.1.5 According to the conceptions of classical electron theory, 
the formation of the reflected wave is due to the secondary waves 
emitted by the molecules, the 

oscillators of the light reflecting | 
medium (34.4.3). Oscillators : l 
(vibrating electric dipoles), lsr 
whose axis is perpendicular to the | 
plane of incidence, correspond 

to a wave of the s-type. These j 
oscillators are shown in Fig. 35.2 l 2/2 

by black dots on the refracted | | 

and reflected rays. As is evident | 

from the polar directional dia- | 

gram (Fig. 31.4), such oscillators Fig. 35.2 

should radiate intensively in a 

all directions lying in the plane 

of incidence, i.e. they should participate in the formation of 
both the refleeted and the refracted s-waves. 

Oscillators, whose axis lies in the plane of incidence and is 
perpendicular to the refracted ray (shown in Fig. 35.2 by trans- 
verse dashes), correspond to a wave of the p-type. Oscillators do 
not radiate along their axis (Fig. 31.4), and at i = ip, the 
reflected ray is perpendicular to the refracted one, thereby being 
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parallel to the axes of these oscillators. Hence, at i = ip, 
these oscillators do not radiate in the direction of the reflected 
ray, making no cantribution to the reflected wave. Consequently, 
the reflected light is completely linearly polarized (s-type 
waves). 


35.2 Birefringence (Double Refraction} 


35.2.1 Most crystals are optically anisotropic (30.1.6). Their 
relative dielectric permittivity (dielectric constant) and refrac- 
tive index depend on the direction of the electric vector E of the 
light wave. Observed in optically anisotropic crystals is the 
phenomenon of double refraction, or birefringence, which consists 
in the splitting of a light ray, falling on a surface of the crystal, 
into two refracted rays. The birefringence of light in a crystal 
of Iceland spar (CaCQ ,) is illustrated in Fig. 35.3. 

35.2.2 The optic axis of a crystal is the direction in an optically 
anisotropic crystal along which light propagates without under- 


a=101°52! B=78° 08" 


Fig. 35.3 Fig. 35.4 


going birefringence. It is necessary to underline that the optic 
axis of a crystal is no single special straight line similar, for 
instance, to the axis of symmetry of a body. It only character- 
izes a certain direction in the crystal and can be passed through 
any point of the crystal. 

Depending upon their type of symmetry, optically anisotropic 
crystals can be either uniazial or biazial, i.e. they can have ei- 
ther one or two optic axes. An example of a uniaxial crystal is 
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Iceland spar, whose optic axis coincides in direction with the 
diagonal Af,N, of the crystal (Fig. 35.3). 

The principal plane (or principal section) of a uniaxial crystal 
for a certain ray is the plane passing through this ray and inter- 
secting the optic axis. 

35.2.3 In a uniaxial crystal, one of the rays formed in birefrin- 
rence (double refraction) obeys the ordinary laws of light refrac- 
tion (31.5.3). It lies in the plane of incidence and complies with 
Snell’s law. It is therefore called the ordinary, or O ray. The 
secondray is called the extraordinary or E ray because it general- 
ly does not lie in the plane of incidence and does not obey 
Snell’s law. For example, even in the case of normal incidence 
of light on a plate cut from a uniaxial crystal, the extraordinary 
ray is refracted (Fig. 35.4). Its refractive angle re depends on 
low the surface of the plate is oriented with respect to the optic 
axis of the crystal. It may be equal to zero only in two cases: 
(a) if the surface of the plate is perpendicular to the optic axis 
(light propagates in the plate along the optic axis and is 
thereby not subject to birefringence); (b) if the surface of the 
plate is parallel to the optic axis (light propagates in the plate 
perpendicular to the optic axis). 

In a biaxial crystal both refracted rays behave like extraordi- 
nary ones. 

35.2.4 Birefringence indicates that a light wave incident on 
an optically anisotropic crystal o two waves, propagat- 
ing through the crystal, generally in different directions. In 
a uniaxial crystal, they are called the ordinary and extraordinary 
waves. The ordinary and extraordinary rays indicate the 
directions of the Umov-Poynting vectors (31.2.3) of the corre- 
sponding waves in the crystal, i.e. the direction of energy trans- 
fer by these waves. 

The ordinary and extraordinary waves are linearly polarized 
(31.1.7).* In the ordinary wave, vector E is perpendicular to 
the principal plane of the crystal for the ordinary ray. The 
clectric vector E of the extraordinary wave lies in the principal 
plane of the crystal for the extraordinary ray. The directions 
of vectors E in the ordinary and extraordinary waves are con- 
ditionally shown in Fig. 35.4 by black dots on the ordinary 


* Frequently, mention is made of linear polarization of the 
ordinary and extraordinary rays, actually meaning the polariza- 
tion of the waves corresponding to them. 
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ray and transverse dashes on the extraordinary ray (it is as- 
sumed that both rays, and optic axis MN intersecting them, lie 
in the plane of the figure). 

35.2.5 The ray velocity of the wave, or simply ray velocity, in an 
optically anisotropic crystal is the velocity v of energy transfer 
by the wave. In a uniaxial crystal, the velocity v, of the ordinary 
ray is numerically equal in all directions: vo = c/n, where c is 
the velocity of light in free space (or in vacuum), and n, = const 
is the refractive index of the crystal for the ordinary ray. According- 
ly, the velocity ve of the extraordinary ray is numerically equal 
to ve = c/ne, where ne is the refractive index of the crystal for 
the extraordinary ray. The values of ne and ve depend on the 
direetion of the extraordinary ray with respect to the optic axis 
of the crystal. For a ray propagating along the optic axis, 
Ne = No and ve = vo. The value of ne differs to the greatest 
extent from that of nọ for a direction perpendicular to the optic 
axis: Ne = nep 

35.2.6 The ray surface of a wave in acrystal is the locus of the 
heads of vectors v of the ray velocity of the wave that extend 
from a certain point O in the crystal in all possible directions. 


„AA, 
Na 
[a 
Fig. 35.5 


In a uniaxial crystal, the ray surface of the ordinary wave has 
the shape of a sphere, whereas that of the extraordinary wave is 
an ellipsoid of revolution about the optic axis MN passing 
through point O. The ellipsoid and sphere contact each other 
at the points of intersection with optic axis MN. When ne> no, 
the ellipsoid is inscribed in the sphere (Fig. 35.5a), whereas if 
Ne<n, the ellipsoid is circumscribed about the sphere 
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(iig. 35.55). In the former case, the uniaxial crystal is said to 
be optically positive; in the latter, optically negative. 

35.2.7 Birefringence in a uniaxial crystal can be explained 
and the directions of the ordinary and extraordinary rays can be 
found by using the Huygens graphical method. Assume that 
n plane unpolarized light wave is incident on the plane sur- 
face ab of a uniaxial optically negative crystal (ora plate cut out 
of this crystal) at the angle i (Fig. 35.6). The optic axis MN 
of the crystal, passing through point A of surface ab, lies in 
(he plane of the figure and makes the angle y with plane ab. 


Fig. 35.6 


At the instant ¢ of time being considered, the front AD of the 
incident wave has reached point*A on the surface of the crystal, 
und this point becomes the source of two linearly polarized 
elementary secondary waves, ordinary and extraordinary, in the 
crystal. At the instant of time t+ At, where At is the time 


required for the incident light to travel the distance DK, the 
disturbance, propagating from point A in the form of the ordi- 
nary elementary wave, reaches the points of a sphere of radius 
r At having its centre at point A. The disturbance, propagating 
Irom point A in the form of the extraordinary elementary wave, 
reaches points at the same time on the surface of the ellipsoid 
of revolution that contacts the sphere of radius v,At at point LZ 
of its intersection with optic axis MN. This ellipsoid is geomet- 
rically similar to the ray surface of the extraordinary wave in 


t 


the crystal (35.2.5). 
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Planes KC, and KCe, perpendicular to the plane of the figure 
and tangent, respectively, to the sphere and the ellipsoid, 
indicate, according to the Huygens principle (33.1.1), the 
positions, at the instant of time ¢-+ Aż, of the fronts of the 
ordinary and extraordinary waves that actually propagate in 
a uniaxial crystal. Straight lines from point A to the points 
of tangency B and F indicate the directions of the ordinary and 
extraordinary rays. Both rays lie in the plane of incidence, but 
the extraordinary ray is not orthogonal to the wave surface KC ¢. 
The ordinary and extraordinary waves are linearly polarized 
in mutually perpendicular planes. The directions of the electric 
vectors Eo and Ee in the ordinary and extraordinary waves are 
shown in Fig. 35.6 by black dots and transverse dashes located 
on the corresponding rays. 

Note. If the optic axis MN of the crystal does not lie in the plane 
of incidence of the light, then, in general, the extraordinary 
ray also does not lie in the plane of incidence. Consequently, 
the angle between the planes of polarization of the ordinary 
and extraordinary waves differs slightly from a right angle. 
35.2.8. The construction for the ordinary and extraordinary 
rays in the case of normal incidence of light on the surface of 


an optically negative uniaxial crystal is shown in Fig. 35.7. 
Here ab is the position of the incident wavefront at the instant ż 
of time, and CoC, and CC; are the positions of the fronts of the 
ordinary and extraordinary waves in the crystal at the instant 
of time ¢- At. 

It is assumed that the optic axis MN lies in the plane of inci- 
dence and makes the angle y, differing from O and 2/2, with 
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the refracting surface ab. It is evident from Fig. 35.7 that the 
ordinary ray is an extension of the incident one and that the 
extraordinary ray is refracted by the angle r, + 0. 

Considered in Fig. 35.8 is a case in which light is normally 
incident on the plane surface ab of an optically negative uni- 
axial crystal, whose optic 

axis MN is parallel to ab. 


Here, as is evident from g A 

the construction, the extra- 1” N 

ordinary ray is not refract- 

cd at the surface ab and 4% ce 
Bee 5 : ; O 

coincides in direction with ọ j 

the ordinary and incident © Ce 


rays. But the velocities of 

the ordinary and extraor- 0,€ O;,€y 
dinary rays in the crystal 

in this direction differ and Fig. 35.8 

are respectively equal to 

(35.2.5): vg = c/no and ve = 

— c/n). Therefore, the two rays (waves) travel the same 
distance d in the crystal with an optical-path difference 
(32.3.2): i 


As — d (no = Neo): 


35.2.9. The path of the rays in a polarizing prism are shown 
in Fig. 35.9. This prism is cut out of a crystal of Iceland spar 


so that its sides AB and CD are parallel to its optic axis MN. 
Then the prism is cut apart along its diagonal plane AC and 
glued together along these diagonal surfaces by a thin layer of 
an optically isotropic transparent substance called Canada bal- 
sam. The crystal of Iceland spar is uniaxial and optically nega- 
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tive; its refractive indices (35.2.5) are: no = 1.658 and nop= 
== 1.486. The refractive index of Canada balsam is nch = 1.550, 
i.e. Canada balsam is a medium less optically dense than the 
material of the prism for the ordinary ray and more optically 
dense than this material for the extraordinary ray. Light falls 
on the prism normal to its side AB (ray S in Fig. 35.9). The ordi- 
nary and extraordinary rays propagate in the prism without 
being refracted until they reach the layer AC of Canada balsam. 
The dimensions of the prism are selected so that the angle of 
incidence i of the ordinary ray on surface AC is greater than the 
critical angle for total internal reflection (31.5.8). Therefore, 
the ordinary wave is completely reflected by the layer of Canada 
balsam (ray o in Fig. 35.9). The extraordinary wave passes freely 
through the layer of Canada balsam and the second half of the 
polarizing prism. Thus, the polarizing prism can be used as a 
polarizer (35.1.2). 

85.2.10. All birefringent crystals absorb light to one or another 
degree. This absorption is anisotropic: the absorption factor 
(34.2.1) depends upon the orientation of the electric vector of 
the light wave and on the direction of light propagation in the 
crystal, as well as on the wavelength. This phenomenon is 
called dichroism, or pleochroism, because it is manifested in the 
different colouring of the crystals in different directions. An 
example of a strongly dichroic crystal is tourmaline, a uniaxial 
crystal in which the ordinary ray is absorbed much more intense- 
ly than the extraordinary ray. Even more pronounced dich- 
roism is displayed by crystals of herapathite, which is used to 
make thin films, converting natural light into linearly polar- 
ized light and called polaroids. 


35.3 Interference of Polarized Light 


39.3.1. Trains of waves with all possible orientations with re- 
spect to the ray of their planes of polarization, included in the 
composition of natural light, are incoherent because they cor- 
respond to the radiation of various independent atoms of the 
light source. All these trains participate in the formation of the 
ordinary and extraordinary waves that propagate in a uniaxial 
crystal when natural light falls on it. But the contribution of 
each separate train to these two waves is, in general, not the 
same. It is greater to the wave whose plane of polarization makes 
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the smaller angle a with the plane of polarization of the wave 
train. In other words, the ordinary and extraordinary waves 
are generated mainly by different wave trains included in the 
composition of natural light. Consequently, the ordinary and 
extraordinary waves, propagated in a uniaxial crystal on which 
natural light falls, are not coherent. 

35.3.2. The ordinary and extraordinary waves, propagated in 
a uniaxial crystal upon the incidence of linearly polarized light 
[obtained from natural light by means, for example, of a po- 
Jarizing prism (35.2.9) or some other polarizer] are coherent with 
cach other. This is due to the fact that the planes of polarization 
of all the wave trains included in the composition of the inci- 
dent light have the same orientation. 

Assume that a parallel beam of light, after passing through polar- 
izer P (Fig. 35.10), is normally incident on the plane surface ab 


p 


Fig. 35.40 Fig. 35.14 


of plane-parallel plate B, cut out of a uniaxial crystal parallel 
to its optic axis MN (axis MN is parallel to plane ab). Shown 
in Fig. 35.14 is vector A; of the amplitude of the ith wave train. 
This vector is oriented along line p-p, corresponding to the di- 
rection of oscillation of the electric vector in the light emerging 
from the polarizer. The contributions of the ith wave train to 
the ordinary and extraordinary waves are characterized by the 
amplitudes Ajo = A; sing and A;,— A; cosa, whose ratio 
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(Aiol ie = tana is the same for all trains. In particular, 
if a = n/4, then Aio = Aie because pairwise coherent trains, 
polarized in mutually perpendicular planes, have the same in- 
tensity. 

35.3.3. At the entrance to crystal plate B (35.3.2), the electric 
vectors E, and E, of the ordinary and extraordinary waves os- 
cillate in phase and their vector sum is equal to electric vector 
of linearly polarized monochromatic incident light: Ep = 
= Eo + Ee. In the plate the ordinary and extraordinary waves 
propagate at different velocities (35.2.8). Therefore, at the exit 
from the plate (of thickness d), the mutually perpendicular elec- 
tric vectors Ef and Eg of the ordinary and extraordinary waves 


oscillate with a phase difference of 


amAs 2nd 

^p = any ae Oe (no— neo), 

where As is the optical-path difference of these waves (35.2.8), 
and A, is the wavelength of light in vacuum. Consequently, as 
a result of its passing through the plate, the light, in the general 
case, becomes elliptically polarized (31.1.7): the head of vector 
E’ = E; + E; describes an ellipse lying in a plane perpendicu- 
lar to the beam. If œ is the angle between the direction of oscil- 
lation of vector Ep and the optic axis MN of the plate, then am- 
plitudes A, and A, of vectors E, and E, equal: Aj =A p sina 
and A= Áp cosa, where Ap is the amplitude of vector Ep. 


In the absence of light absorption in the plate, the amplitudes of 
vectors E and Ej; are also equal to Ao and Ag. 

39.3.4. In accordance with the thickness d of the plate, several 
special cases are possible. 

(a) A quarter-wave plate is one whose thickness satisfies the rela- 
tion: d (no — neo) = + (m+ 1/4) ào where m = 0, 1, 2, ...3 
the plus sign corresponding to an optically negative crystal, 
and the minus sign to an optically positive one (35.2.6). At the 
exit from such a plate the oscillations of vectors Ef and Ej have 


a phase difference of x/2. If, in addition, a = 7/4, the light 
emerging from the plate is circularly polarized (31.1.7). 

(b) For a half-wave plate: d (no — neo) = + (m + 1/2) Ag. 
At the exit from such a plate, the oscillations of vectors E, 
and Eg have a phase difference of xn. Light emerging from the 
plate remains linearly polarized. But the directions of oscilla- 
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tion of vectors E, and E’ of the incident and transmitted light 
are symmetrical with respect to the principal plane of the plate 
(Fig. 35.12). 

(c) For a full-wave plate: d (no — neg) = mà. As a result of pass- 
ing through the plate, the light remains linearly polarized 
in the same plane as the incident 
light. 

35.3.5. Coherent waves emerging from 
crystal plate B (Fig. 35.10) cannot 
display interference because they are 
polarized in mutually perpendicular 
planes. Therefore, one more polar- 
izing prism, the analyzer A (Fig. 35.13) 
is mounted beyond plate B. The ana- 
lyzer separates out of the incident 
coherent waves, the components po- 
larized in one plane. It thus provides Fie. 35.12 

the conditions necessary for obtain- nee as 

ing the interference of these waves. 

The results of this interference depend upon the phase difference 
Ag acquired by the ordinary and extraordinary waves in the 
plate, on the ratio’*of the amplitudes of these waves and on the 
angle B between the principal planes of the analyzer and po- 
larizer (35.1.2). 

For example, if the angle between the principal plane of the 
polarizer and optic axis MN of the plate a = x/4, the ampli- 
tudes and intensities of the ordinary and extraordinary waves 
are the same. Assume in this case that monochromatic light of 
the wavelength A, in vacuum is incident on the plate. The two 
following limiting cases may occur: 


A ac l oe ES + 2mn 
P SR = 9 4 O 


(where m=0, 1, 2,...). 


In the first case, corresponding to a full-wave plate, the light 
falling on the analyzer is linearly polarized in the principal 
plane of the polarizer. Therefore, at B = 0 (the analyzer is 
mounted parallel to the polarizer), the intensity Z, of the light 
passing through the analyzer is maximal, whereas at B = 7/2 
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(the analyzer and polarizer are crossed), J, = 0, i.e. at B = 0 
an interference maximum is observed, whereas at B = n/2 a 
minimum is observed. 

In the second case, corresponding to a half-wave plate, the light 
falling on the analyzer is linearly polarized in a plane making 
the angle 2a = n/2 with the principal plane of the polarizer. 
Therefore, at B =O an interference minimum is observed, 
whereas at B = x/2 a maximum is observed. 

If linearly polarized white light is incident on plate B (Fig. 35.13), 
then in observing the light through the analyzer, the plate is 


b 


Fig. 35.13 


seen to be coloured. Upon rotating the analyzer about the beam, 
i.e. in varying angle B, the colouring changes. This is due to 
the fact that the value of the phase difference Ag, determining 
the result of interference, depends upon the wavelength of the 
light. A plate, whose thickness d is not the same at various 
places, is seen in white light to be oddly coloured, with each col- 
oured interference line (isochromate) passing through points of 
equal thickness d. A similar pattern is observed in a plate, which 
is of equal thickness at all points, but the value of the difference 
(no — neo) varies. In this case, each isochromate passes through 
points of the plate corresponding to the same value of 
(no — Neo). 


35.4 Artificial Optical Anisotropy 


35.4.1 An optically isotropic transparent body becomes an- 
isotropic if it is subjected to mechanical deformation. This 
phenomenon is sometimes called photoelasticity. Upon lincar 
(uniaxial) tension or compression of an isotropic body along 
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the OX axis, it acquires the optical properties of a uniaxial 
crystal (35.2.2), whose optic axis is parallel to OX. The differ- 
ence between the refractive indices for the ordinary (n,) and 
extraordinary (neo) rays, in the direction perpendicular to axis 
OX, is proportional to the normal stress o (40.3.3). Thus 


No — Ney = ko, 


where k is the proportionality factor depending upon the prop- 
erties of the body. 

35.4.2 The electrooptical Kerr effect is the appearance of optical 
anisotropy in a transparent isotropic solid, liquid or gaseous 
dielectric upon being placed in an external electric field. By 
the action of the uniform electric field, the dielectric is polar- 
ized and acquires the optical properties of a uniaxial crystal 
whose optic axis coincides. in direction with the field strength 
vector E. The difference between the refractive indices of a 
polarized dielectric for extraordinary and ordinary rays of 
monochromatic light, propagating perpendicular to the direc- 
tion of vector E, satisfies the Kerr law: 


neo — No = Bhyk?, 


where A, is the wavelength of the light in vacuum, and B is 
the Kerr constant. The value of B depends upon the nature of the 
substance, wavelength A, and the temperature, decreasing, as 
a rule, as the temperature is raised. The sign of the difference 
(neo — No) is independent of the direction of the field. For the 
majority of substances B > 0, so that they are similar with 
respect to their optical properties in a uniform electric field 
to optically positive uniaxial crystals (35.2.6). 

35.4.3 The Cotton-Mouton effect is the appearance of optical 
anisotropy in certain isotropic substances (liquids, glasses and 
colloids) when they are placed in a strong external magnetic 
field. In a uniform magnetic field, the substance acquires the 
optical properties of a uniaxial crystal, whose optic axis coin- 
cides in direction with the magnetic field strength vector H. The 
difference between the refractive indices of the substance for 
extraordinary and ordinary rays of monochromatic light, prop- 
agating in the direction perpendicular to vector H, is propor- 
tional to H?. Thus 


Neo aaa No = Ch H?, 
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where C is the Cotton-Mouton constant, and àg is the wavelength 
of light in vacuum. The value of constant C depends upon the 
nature of the substance, wavelength A, and the temperature. 


35.5 Rotation of the Plane of Polarization 


35.5.1 As linearly polarized light passes through certain sub- 
stances that are said to be optically active, the plane of polar- 
ization of the light (31.1.7) rotatesabout the direction"of the ray. 
Optical activity is displayed by certain crystals (quartz, cin- 
nabar, etc.), pure liquids and solutions (turpentine, a solution 
of sugar in water, etc.). All substances that are optically active 
in the liquid state, possess the same property in the crystalline 
state as well. But some substances that are optically active in 
the crystalline state, lose this activity in the liquid state. Conse- 
quently, optical activity can be due either to the structure of the 
molecules of the substance themselves or to the arrangement 
of the particles in the crystal lattice. 

30.5.2 In optically active crystals and pure liquids, the angle ọ 
of rotation of the plane of polarization of light is proportional 
to the thickness 1 of the layer of the substance through which 
light passes. Thus ọ = al. The proportionality factor œ is 
called the rotatory power, or specific rotation. The rotatory power 
depends upon the nature of the substance, the temperature and 
the wavelength A, of the light in vacuum. The dependence of 
a on i, is called the rotary, or rotatory, dispersion. Far from the 
band of light absorption by the substance, rotary dispersion 
obeys Biot’s law: a X 1-2. 

35.5.3 The majority of optically active crystals exist in two 
versions. As light passes through a crystal of one version, said 
to be dextrorotatory or right-handed, the plane of polarization 
is rotated to the right, i.e. clockwise (for an observer looking 
toward the oncoming beam). In the passing of light through a 
crystal of the other version, said to be levorotatory or left-handed, 
the plane of polarization is rotated to the left (counterclockwise). 
The values of the rotatory power for the two versions of the same 
optically active crystal differ only in sign. 

39.5.4 The angle of rotation of the plane of polarization of light 
in travelling a path of length Z in an optically active solution is 


© = [a] cl = [a] DKL. 
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Here c is the volume-mass concentration of the optically active 
substance in the solution (in kg/m), D is the density of the 
solution, and K == c/D is the weight-part concentration, i.é. 
the ratio of the mass of the optically active substance to thé 
mass of the whole solution. The proportionality factor [a] is 
called the rotatory power, or specific rotation, of the solution. 
The value of [a] depends on the nature of the optically active 
substance and of the solvent, the wavelength of the light and 
the temperature. 

35.5.5 When subjected to the action of an external magnetic 
field, an optically inactive medium acquires the capacity to 
rotate the plane of polarization of light propagating along the 
direction of the field. This phenomenon is known as the Faraday 
effect, or magnetic rotation of the plane of polarization of light. 
The angle @ of rotation of the plane of polarization is propor- 
tional to the path length of the light in the substance and to the 
strength H of the magnetic field. Thus ọ = VHI. The propor- 
tionality factor V is called the Verdet constant. It depends upon 
the nature of the substance and the wavelength of the light. 

The direction of magnetic rotation of the plane of polarization 
(for an observer looking along the magnetic field) is the same 
whether the light propagates along the direction of vector H 
or in the opposite AEn. In this feature the Faraday effect 
differs from the rotation of the plane of polarization of light 
in natural optically active media. 


CHAPTER 36 THERMAL RADIATION 


36.1 Thermal Radiation. Kirchhoff’s Law 


36.1.1 All bodies emit electromagnetic waves to one or another 
degree. Highly heated bodies, for instance, glow, whereas at 
ordinary temperatures they are the source of only invisible 
infrared radiation. 

Electromagnetic radiation, emitted by a substance and produced 
at the expense of its internal energy (9.1.2) is called thermal, 
or heat, radiation. It depends only on the temperature and the 
optical properties of the radiating bodies. If the energy consump- 
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tion of a body on thermal radiation is not replenished by sup- 
plying more heat to the body, its temperature gradually drops 
and the thermal radiation is reduced. 
Radiative heat exchange (heat exchange through radiation) is the 
spontaneous process of transferring energy in the form of heat 
from a more heated body to a less heated one. It is accomplished 
by thermal radiation and the absorption of clectromagnetic 
waves by these bodies. 
36.1.2 Thermal radiation is the only kind that can be in a state 
of thermodynamic equilibrium (8.3.3) with matter. In equilib- 
rium, the energy consumption on thermal radiation is compensat- 
ed for by the absorption of the 
A B same amount of energy by the 
body owing to the radiation fal- 
ling on it. Radiative (radiation) 
equilibrium is set up in an adia- 
batically closed system (i.e. one 
that does not exchange heat with 
the external medium), all bodies 
of which are at the same temper- 
i ature. 
nee ee It follows from the second law 
of thermodynamics (11.3.2) that 
radiative equilibrium is independent of the material of 
the bodies that make up the closed thermodynamically equilib- 
rium system. The volume density of the energy of radiation 
equilibrium and its frequency distribution are universal func- 
tions of the temperature. As a matter of fact, if this were no- 
so, it would be possible to take two adiabatically closed systemst 
A and B (Fig. 36.1), at the same temperature 7, = Tp, = T, 
and accomplish radiative heat exchange between them by mak- 
ing a small hole in the heatproof wall separating them. If the 


volume density of energy ES of system A is greater than that 


EŠ of system B, i.e. Ee > ES then, owing to heat exchange 


between the systems, the radiant energy in system A and its 
temperature should decrease and the radiant energy in system B 
and its temperature should increase until the values of the vol- 


ume energy densities in systems A and B become equal: EG = 
= bsi But, in this case, Ta > T’,, so that the process being 
considered contradicts the second statement of the second law 
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of thermodynamics (11.3.2). Consequently, EA cannot be great- 
er than ES. In exactly the same way, Ee cannot be greater 
than EG i.e. EG = ES, = Eà (T) is a universal function of 


the temperature. 

36.1.3 Serving as the spectral characteristic of radiative equi- 
librium is the spectral density of the volume energy denstiy of 
this radiation. Thus 


. GES 
0 (Vv, r')= dv 9 


where dE4 is the energy of radiative equilibrium with frequen- 
cies from v to v + dv enclosed within unit volume of the radia- 
tion field. The volume energy density of this field is 


Eå = \ ov, T) dv. 


Orem g 


Radiation equilibrium is isotropic, i.e. it is not polarized and 
all directions of its propagation are equally probable. The ener- 
sy dE of radiation equilibrium in vacuum at frequencies from 
y to v + dv incident in unit time on unit area of the surface of 
each of the bodies of a thermodynamically equilibrium system 
is equal to 


dR =+ Pp (v, T) av, 


where c is the velocity of light in free space (or in a vacuum). 
36.1.4 The radiant exitance* (total emissive power) of a body is 
the physical quantity Me, numerically equal to the energy of 
electromagnetic waves of all possible frequencies (or wavelengths) 
from 0 to œ** radiated in unit time from unit area of a body’s 
surface. 


* Formerly called the radiant emittance. 
** It is practically sufficient to be restricted to the range of 
frequencies and wavelengths of optical radiation (31.4.3). 


33—0186 
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The emissive power (spectral density of radiant exitance) of a body 
is the physical quantity numerically equal to the ratio of the 
energy dE radiated in unit time from unit area of the surface of 
the body by electromagnetic waves in a narrow range of fre- 
quencies from v to v -4- dv (or wavelengths in vacuum from A to 
à + da), to the width of this range: 


_ dE dE, EE 
r = ——— OF RSE r ly? 


where c is the velocity of light in vacuum. The values of ry 
(or r,) depend upon the frequency (or wavelength), temperature, 
chemical composition of the body and the condition of its sur- 
face. 

The radiant exitance of a body is related to r, and r} by the 
equations 


($9) 


ao 
M e = \ ry dv = \ ri ah. 
0 0 


36.1.5 The absorptance (monochromatic absorption coefficient) 
of a body is the dimensionless quantity a, which indicates what 
fraction of the energy of electromagnetic waves of frequencies 
from v to v + dv, incident on the surface of the body, is ab- 
sorbed by it. Thus 


dE abs 
dE ine 


The value of a, depends upon the frequency, temperature, chem- 
Sap composition of the body and the condition of its sur- 
ace. 

A blackbody (denoted by a superscript asterisk) is one that com- 
pletely absorbs all incident radiation, regardless of its direc- 
tion, spectral composition and polarization, without reflecting 
or transmitting any: a* = 1. Though a blackbody is a theoreti- 


cal ideal, it can be nearly realized in practice by an almost com- 
plete enclosure with a small opening (Fig. 36.2). Light entering 
the enclosure through hole O undergoes repeated reflection from 
the walls. At this the energy of the incident light is practically 
completely absorbed by the walls of the enclosure regardless 


Gy <í. 
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It: 


of the material of which they are made. The emissive power of 
a blackbody is denoted in the following by rë (or rž) and its 
radiant exitance by M¥. 

A greybody is one whose absorptance is less than unity and is 
independent of the frequency (or wavelength) of the light, the 
directions of its propagation and polarization: 


aR" 
ov 


36.1.6 According to the principle of detailed balance any micro- 
scopic processin an equilibrium system should proceed at the 
same rate as the reverse process. This prin- 

ciple of statistical physics enables a rela- 

tion to be found between the emissive 

power r, and the absorptance a, of any 

opaque body. Assume that the body is part 

of a thermodynamically equilibrium sys- 

tem at the temperature 7. The energy ra- 

diated per unit time from unit area of the h, 
surface of the body being considered in 

the frequency range from v tov + dvis Fig. 36.2 
dEraq = Tydv. During the same time, the i 

same portion of the surface of the body ab- 

sorbs a part of the energy of the radiation equilibrium (36.1.3) 
incident on this'surface. It is equal to dE aps = dy (c/4) p (v, T)dv. 
Since, according to the principle of detailed balance, dE;aq = 
= dEahs, 


=0 and Sarr. 


J 


This equation expresses Kirchhof’ s law (for radiation), which 
states that the ratio of the emissive power of a body to its ab- 
sorptance does not depend upon the nature of the body and is 
equal to the emissive power r* of a blackbody at the same tem- 
perature and frequency values. 

The dependence of rž on v and T is called Kirchhoff's function: 


ré=f(v, T= ely, T) 


33% 


5416 THERMAL RADIATION [86 


36.1.7 It follows from Kirchhoff’s law that the radiant exi- 
tance (36.1.4) of a body is equal to 
M e = \ Ayri dv. 
0 
In particular, the radiant exitance of a greybody is 


oo 
r * f 
Me = a8™Me, where Mz = \ rgav, 
0 


is the radiant exitance of a blackbody at the same temperature. 
For a nongrey body 


= BM*, 


where P is the total degree of blackness of the body. It depends 
on the material of the body, the condition of its surface and the 
temperature. For all bodies except a blackbody, B < 1. 

36.1.8 Radiation equilibrium at the temperature T is identical 
to thermal radiation of a blackbody at the same temperature. 
Therefore, it is often called blackbody radiation. The relation 
between the radiant exitance of a blackbody and the volume den- 
sity of the energy of blackbody radiation is of the form 


c c 
M* = TA \ o(v, T)dv. 
0 


36.2 Stefan-Boltzmann and Wien Laws 


36.2.4 The Stefan-Boltzmann law states that the radiant exi- 
tance of a blackbody is proportional to the fourth power of its 
absolute temperature. Thus 


M* = of, 


where o — 5.67 X 10-8 W-m-2-K-* is the Stefan-Boltzmann 
constant. This law can be derived theoretically by considering 
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the radiative equilibrium in an enclosed cavity by thermody- 
namic methods. 

36.2.2 The dependence of the emissive power r¥ of a blackbody 
on the frequency v for several constant values of the temperature 
is shown in Fig. 36.3. In the region of low frequencies, rë & 
X y2T, whereas for the region of high frequencies (the right- 
hand branch of the curves far away from the maxima) rë & 
X y3 exp [—a,v/T], where a, is a constant factor. 

The radiant energy of a blackbody is nonuniformly distributed 
in its spectrum. A blackbody displays almost no radiation in 


Y 
h% 


O Wee 
I No m3 Y O An Aim Am, A 


Fig. 36.3 : Fig. 36.4 
the regions of very high and very low frequencies. As the tem- 
perature is raised, the maximum of rž is shifted toward the 


higher frequencies in accordance with the law: v,, = 6,7, where 
Vm is the frequency corresponding to maximum rž at the tem- 


perature T, and b, is a constant factor. R 
The dependence of the emissive power of a blackbody rž = 
= cr¥/M2 (36.1.4) on the wavelength A is shown in Fig. 36.4. 
As the temperature of the blackbody is raised, the maximum of 
rx is shifted toward the shorter wavelengths in accordance with 
the Wien displacement law: 

D 

T’ 

where b = 2.9 X 10 m-K is the Wien constant. 


Ke = 
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36.2.3 All attempts that were made failed to theoretically sub- 
stantiate the experimentally found form of Kirchhoff’s func- 
tion, C= f (v, T), shown in Fig. 36.3, within the framework 


of classical physics. Using thermodynamic methods, the Wien 
formula was found: 


r$ =v (+), 


where ọ (v/T) is an unknown function of v/T. 

Rayleigh and Jeans applied the laws of electrodynamics and 
equipartition of energy in an equilibrium system (10.6.4) to 
obtain the Rayleigh-Jeans law: 


: 2 
rž == a kT 
y 2 ’ 


where k is Boltzmann’s constant (8.4.5). 

The Rayleigh-Jeans law agreed with experimental data only 
in the region of low frequencies. Moreover, it leads to the ab- 
surd conclusion that at any temperature the radiant exitance 
M* of a blackbody and the volume density of energy £4 of 
radiation equilibrium are infinitely large. This conclusion, ar- 
rived at by classical physics in the problem of explaining the 
spectral distribution of radiation equilibrium, has been aptly 
named the “ultraviolet catastrophe’. 


36.3 Planck’s Formula 


' 36.3.4 The volume density of energy of radiation equilibrium 
(blackbody radiation) in a closed cavity, as well as the energy 
distribution of this radiation in frequency are independent of 
the material of the cavity walls and are fully determined by the 
temperature. Therefore, an infinite system of harmonic oscilla- 
tors with all possible natural frequencies can be taken as a theo- 
retical model of a blackbody. Each of such oscillators corresponds 
to a monochromatic component of blackbody radiation. Let 
(Ey) be the average energy of an oscillator with the natural 
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frequency v. Then, according to calculations, the radiant exi- 
tance of a blackbody is 


2mv2 
c2 


(Ey). 


If kT is taken for (Z,,), as follows from the classical law of the 
cquipartition of energy (10.6.4)*, then the preceding expression 


for ry coincides with the Rayleigh-Jeans law (36.2.3). 
36.3.2 Planck was able to find the correct expression for the 
average energy (E, of an oscillator and for the Kirchhoff 


function by introducing the quantum hypothesis, which was en- 
tirely alien to classical physics. It is assumed in classical phys- 
ics that the energy of any system can vary in a continuous man- 
ner, i.e. that it can take any arbitrarily close consecutive val- 
ues. According to Planck’s quantum hypothesis, the energy 
Ey of an oscillator can have only definite discrete (quantized) 
values, equal to a whole number of elementary portions, called 
energy quanta Ey. Thus 


Ey = nEy, Where n = 0, 1, 2,.... 


If it is assumed that-the oscillator distribution in possible dis- 
crete energy states is described by the Boltzmann distribution 
law (10.4.2), the probability pn of finding the oscillator in a 
state with the energy nE, at the temperature 7 is equal to 


a 
ry 


Pn = C exp [—nE,, /kT}. 


Here k is the Boltzmann constant (8.4.5) and C is a constant 
factor determined from the normalization condition: 


S| Dat. i.e. Dee ee, 
n=0 N) exp [—nEyo/kT] 
n=0 


* One oscillatory degree of freedom of an oscillator accounts for 
twice as much energy, on an average, as one degree of transla- 
tional or rotary motion, because the oscillator has not only 
kinetic, but potential energy as well. On an average, this poten- 
tial energy is equal to the kinetic energy. 
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The average value of the oscillator energy is 


e ` n exp [—nEy/kT] 

(Ey) = >; Pnn Eyi = Ev <= 
n=0 oy exp [| —nEyy/kT] 

n=0 


from which 


-E 5) exp (—n8) 
n=0 


>} exp (— n$) 


n=0 


(Ey) = — Ew 


d 2 
=E A In > exp (— ný), 


n=0 
where = Ey)/kT. Since 
= : 1 
2 an en 
n=l 
and 


—In y exp (—n&)=I1n (16-5), 
n=0 
then 


[36 


Comparing this expression for r* with the Wien formula (36.2.3), 


it follows that a quantum of energy is equal to 
E, = hy, 


Vo 
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where h is a universal constant, called Planck’s constant (Appen- 
dix II). 
Planck’s radiation formula for the emissive power r* of a black- 
body is 


2Itv2 hv 
c? ghv/kt L4 


x — 
ry 


Accordingly, the spectral density of radiation equilibrium (36.1.3) 
is equal to 


8nay? hv 
a ar Lee 
36.3.3 At low frequencies (hkv< kT), exp [hv/kT — 1] = 
x hv/kT and the Planck radiation formula coincides with the 
Rayleigh-Jeans law (36.2.3). 
Following from Planck’s radiation furmula is the Stefan-Boltz- 
mann law (36.2.1): | 


oo 


27h v3 dv 
ge: : 
Me T p? ) ePYIET A 
= 2nk4T4 o nè dy = 20°k4 n 
c2h3 en—1 15¢2h3 
0 


The Planck constant is related to the Stefan-Boltzmann con- 
stant o (36.2.1) as follows: 


3 / 97554 
h= V TE 062 x 10784 J-s. 
36.3.4 The Planck formula for the emissive power rf of a black- 


body is 


27c2 he 1 
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The wavelength Àm corresponding to maximum rj is determined 
from the transcendental equation 


ze” — 5e% 4+- 5 = 0, where z = he/A, kT. 


The root of this equation z = 4.965, and Àn satisfies the Wien 
displacement law (36.2.2): 
he 
—~ | Thorn ey -3 ż 
Am? =b, where b 7965 k 2.9X 10-3 m-K, 


36.4 Optical Pyrometry 


36.4.1 Optical pyromeiry is the name given to miscellaneous 
techniques for measuring high temperatures based on the laws 
of thermal radiation. The instruments used for this purpose are 
called pyrometers. 

Radiation pyrometers record the integrated radiation of the hea- 
ted body being investigated; optical pyrometers record the ra- 
diation of a heated body in one or two narrow portions of the 
Spectrum. 

36.4.2 The radiant flux D, is the average power of optical radia- 
tion (31.4.3) during a time considerably greater than a period 
of oscillation of the electromagnetic field of the light. 

The irradiance E (radiant flux density) of a surface is the ratio 
of the flux of incident radiation on this surface to its area. Thus 
E. = d@,/dS, where d®, is the radiant flux incident on a por- 
tion of the surface of area dS. 

The radiant intensity I, of a radiative body is the radiant flux 
emitted by the body within a unit solid angle. Thus I, = 
= d®D,/dQ, where d@, is the radiant flux within the solid angle 
dQ. 

36.4.3 The radiance L, of the portion dS of a radiative surface 
in a given direction is the ratio of the radiant intensity dl, 
of area dS in the direction being considered to the area of the 
projection of dS on a plane perpendicular to the direction. Thus 


dle 


fe ees 
e dS cos p ’ 


where @ is the angle between the specified direction and the 
normal to the surface of area dS. 
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The spectral density of the radiance is the ratio of the radiance 
dLe, corresponding to a narrow band of the optical spectrum, 
to the width of this band. Thus 


dh 


36.4.4 A source of optical radiation is said to obey Lambert's 
law, or to be a cosine radiator, if its radiance Le, as well as its 
spectral densities b, and b, are the same in all directions, i.e. 
are independent of angle ọ. A blackbody could be called a cosine 
radiator. 

The radiance of a cosine radiator and its spectral densities are 
related to its radiant exitance and its spectral densities (36.1.4) 
as follows: 


by oe and by = 


M r ry 
£ , w= — and b =—* 
IU IU x 


Le = 


36.4.5 In optical pyrometry, distinction is made between ra- 
diation, brightness and colour temperatures of a body. 

The radiation temperature T, of a body is the temperature of 
a blackbody at which its radiance Z% is equal to the radiance 
Le of the given body. If the body being investigated is a cosine 
radiator, whose total degree of blackness is B (36.1.7), then, 
from the condition that Le (T) = L* (T,), where T is the true 


temperature of the body, it follows that 


Ai 
VB 


36.4.6 The brightness (luminance) temperature Th of a body is 
the temperature of a blackbody at which its spectral density 
b of radiance fora definite wavelength A, is equal to the spec- 
tral density b, of radiance for the given body for the same wave- 
length: ba (Ag, T) = bE (Ap, Th). Usually Ao =~ 660 nm (red 
light). 

lor a cosine radiator, whose absorptance (36.1.5) for light of 
the wavelength A, at the temperature T of the body is equal to 


BM*(T)=M*(T,) and T=TE > Tr. 
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ay (Ay, T), it follows from Kirchhoff’s law (36.1.6) and from 
Planck’s formula (36.3.4) that 


hy (Ag, T) rx (Ao, T) = rž (Ag; Tp) 


and 
i d: l a. 
; T 3 Y sk -ee eine | et ; as: ae eee, 3 
Ar (Ay, T) | exp ( Th exp ( nT 4 


where a, = he/k. Since a (àp, T)< 1, then T> Tp. 

36.4.7 The colour temperature Te of a body is the temperature of 
a blackbody at which the relative distributions of the spectral 
density of the radiance b¥ of the blackbody and that b, of the 


body being considered are close to one another to the maximum 
degree in the visible region of the spectrum. Thus 
by (Ay, T) bX (Ay, Te) 


Py Oar FY Of Og, Fe) 
For a cosine radiator 
ra (A,, T) ry (Ay, To) 


ra (he, D) r* (Ag, Te) ° 


As a rule, A, = 655 nm (red light) and A, — 470 nm (green 
light). The colour temperature of a greybody (36.1.5) coincides 
with its truc temperature and can be found by Wien’s displace- 
ment law (36.2.2). 


CHAPTER 37 FUNDAMENTALS OF QUANTUM 
OPTICS 


37.1 External Photoelectric Effect 
(Photoemissive Effect) 


37.1.1 Quantum optics is the branch of optics that studies phc- 
nomena in which the quantum properties of light are manife- 
sted. Such phenomena include thermal radiation (36.1.1), the 
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photoelectric effect, the Compton effect (37.3.1) and photo- 
chemical processes. 

The photoelectric effect in gases consists in the ionization of the 
atoms and molecules of the gas due to the action of light, and 
is usually called photoionization. 

In condensed bodies (liquids and solids) distinction is made 
between the external and internal photoelectric effects. 

The external photoelectric effect (photoemission) is the emission 
of electrons by a substance subject to the action of light. Elec- 
trons ejected from the substance in the external photoelectric 
effect are called photoelectrons, and the electric current formed 
by them in their ordered motion in the external electric field 
is called the photoelectric current. 

The internal photoelectric effect is the redistribution of electrons 
among the energy states in solid and liquid semiconductors and 
dielectrics due to the action of light. It is manifested in the 
change in the concentration of current carricrs in a medium 
(20.4.4) and leads to photoconduction or the barrier-layer pho- 
toelectric effect. Photoconduction is an increase in the electrical 
conductivity of a substance due to light falling on it. The bar- 
rier-layer photoelectric effect is the generating of an emf (photo- 
emf) by the action of light in a system consisting of a semicon- 
ductor and metal in contact or two different kinds of semicon- 
ductors [for example at a p-n junction (41.11.7)]. 

37.1.2 Shown in Fig. 37.1 is an installation for investigating 
the external photoelectric effect in metals. Light enters through 
window D and falls on the surface of cathode K, which is inside 
a vacuum tube and is called the photoelectric cathode. The shape 
of the curve, representing the dependence of the photoelectric 
current J in the tube on the potential difference U between the 
anode A and cathode K upon constant irradiance F, of the cath- 
ode (36.4.2) by monochromatic light, is shown in Fig. 37.2. 
The existence of a photoelectric current at negative values of U 
from 0 to —U, indicates that the photoelectrons have a certain 
initial velocity when emitted from the cathode, and the corre- 
sponding kinetic energy. The maximum initial velocity vmax of 
the photoelectrons is related to the retarding voltage (retarding 
potential) U, by the equation 


mv? 
max — elp, 


2 
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where e and m are the absolute values of the charge and the mass 
of the electron. 

The photoelectric current increases with U only to a definite 
limiting value J, called the saturation photoelectric current. At 
this value, all the electrons escaping from the cathode, due to 


A 
A— 
—> 
“Up 0 U 
Fig. 37.1 Fig. 37.2 


the effect of light, reach the anode. If nsec is the number of pho- 
toelectrons emitted from the cathode per second, then 


Is = ensec. 


o The external photoelectric effect obeys the following 
aws. 

1. The Stoletov law slates that at a constant spectral composition 
of the light incident upon the photoelectric cathode, the satu- 
ration photoelectric current is proportional to the irradiance 
(36.4.2) of the cathode: 


Is X Ee and nsec X Ee 


2. For a given photoelectric cathode, the initial velocity of the 
photoelectrons depends upon the frequency of the light, but 
not on its intensity. 

3. A photoelectric threshold exists for each photoelectric cathode, 
i.c. the minimum frequency v, of light at which the external 
photoelectric effect is still feasible. This frequency vg depends 
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upon the material of the photoelectric cathode and the condition 
of its surface. 

The second and third laws cannot be interpreted on the basis 
of the classical electromagnetic theory of light. According to 
this theory the removal of conduction electrons from the metal 
is a result of their “swaying” in the electromagnetic field of the 
light wave so that they are “loosened”. This effect should increa- 
se with the light intensity and with the irradiance of the photo- 
electric cathode, proportional to the intensity. 

37.1.4 Only the quantum theory of light enabled the laws of 
the external photoelectric effect to be properly explained. In 
a further development of Planck’s ideas on the quantization 
of the energy of atoms (oscillators) (36.3.2), Einstein proposed 
the hypothesis that light is not only radiated, but also propa- 
gates in space and is absorbed by matter in the form of separate 
discrete quanta of electromagnetic radiation, which were nam- 
ed photons. All photons of monochromatic light of the frequen- 
cy v have the same energy Ey, = hv, where k is Planck's constant, 
and travel in space at the velocity c of light in vacuum (or in 
free space). In the absorption of light each absorbed photon 
transmits all of its energy to a particle of the absorbing sub- 
stance. In the external photoelectric effect, for example, a con- 
duction electron of the metal, in absorbing a photon, receives 
all of its energy hv. The work function ® (41.11.1) is the mini- 
mum energy that it is necessary to impart to the electron for 
it to escape from a metal. Hence, Finstein’s photoelectric emis- 
sion equation, expressing the law of the conservation of energy 
in the photoelectric effect, is of the form 


mv? 
hw=0 -+ —S—, 


37.1.5 The second law of the photoelectric effect follows directly 
from Einstein’s equation: 


mv2 


Thus, vmax and U, depend only upon the frequency of the light 
and the work function in emitting an electron from the photo- 
electric cathode. 
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The maximum initial kinetic energy of the photoelectrons de- 
pends upon the light frequency according to a linear relation. 
It becomes zero at the frequency vy, corresponding to the pho- 
toelectric threshold. Thus 


Consequently, the photoelectric threshold depends only on the 
electron work function of the metal. 

37.1.6 The photoelectric effect is inertialess, i.e. the emission 
of photoelectrons begins immediately, as soon as light with the 
frequency v > v, falls on the photoelectric cathode. This. feature 
of the external photoelectric effect is still another confirmation 
of the quantum nature of interaction between light and matter. 
According to classical wave conceptions, quite considerable 
time is required for an electromagnetic wave of given intensity 
to impart sufficient energy to the electron for its emission from 
the metal. 

37.1.7 At very high intensities of light, achieved by means 
of lasers (39.8.8), a multiple-photon, or nonlinear, photoelectric 
effect is observed. In this case, the electron can simultaneously 
receive the energy of N photons rather than that of one photon. 
Then the equation of the law of conservation of energy in the 
external photoelectric effect under the action of light of frequen- 
cy v is of the form 


mv? 
Nhiyx=04+ — 5. 


The photoelectric threshold of the N-photon photoelectric effect 
is 


37.2 Mass and Momentum of the Photon. 
Light Pressure 


37.2.1 The mass of the photon (37.1.4) having the energy Eph = 
— hy, where v is the frequency of the light and h is Planck’s 
constant, can be found from the mass-energy relatian (5.7.2). 
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Thus 


hv 


Mph S5-a , 


where c is the velocity of light in free space (or in a vacuum). 
The photon always travels with the velocity c, and its rest mass 
(5.6.1) equals zero. 

37.2.2 The momentum Ppn of the photon is numerically equal to 


pe eect ee hv h 
Pph >= pht == € aa ee 


where A is the wavelength of the light in vacuum. Since the 
wavenumber is k = 2n/A, 


h 
Pph =a k= hk and Pph = Ahk, 


where i = h/2n, and k is the wave vector (30.2.7). 

37.2.3 Light exerts a pressure on the bodies that reflect or ab- 
sorb it. In quantum optics, light pressure is interpreted as the 
result of the transfer of momentum to bod- 
ies by photons as the light is reflected or 
absorbed. The light pressure p on the plane 
surface ab of a body (Fig. 37.3) is equal to 
the numerical value of the normal compo- 
nent of the total momentum transferred 
by photons to the body on unit area of the 
surface being considered during unit time. 
Assume that monochromatic light of fre- 
quency v is incident upon surface ab at the Fig. 37.3 

angle i (Fig. 37.3) and that nsec is the 

number of photons incident per second upon unit area of sur- 
face ab. If R is the coefficient of reflection (31.5.6) of light from 
the surface being considered, then, of the nsec photons, Rngec 
are reflected and (1 — R) nsec are absorbed. The photons being 
reflected transfer a total momentum to the body, directed normal 
to the surface ab and numerically equal to 


V . 
Rnsec Cos i 


34—0186 
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The photons being absorbed transfer a total momentum to the 
body, whose component normal to surface ab is numerically 
equal to 


h 
(1 == R) nsec Z cosi. 


Thus the light pressure equals 


h 
p = Rnsec = cosi + (1— R) Nsec A eosi 


= (1+ R) nsec * cos i. 


If nọ is the photon concentration of the incident light, then 
nsec = noc Cosi and n,hv = (E4%) is the average value of the 
volume energy density of the light. Hence 


p = nohy (1 + R) cos? i = (Ed) (1 + R) cos? i. 


Thus, the pressure of light can be equally well explained by 
either wave theory (31.2.6) or qùantum theory. Da 


37.3 Compton Effect 


37.3.1 The Compton effect is the change in the wavelength of 
X rays when they are scattered by a substance containing light- 
weight atoms. The wavelength i’ of the radiation, scattered at 
the angle 0 to the direction of propagation of the primary mono- 
chromatic radiation of wavelength A, islonger than A by the 
amount AA, which depends only on angle 0. Thus s 


E E OY We sin? È, 


The constant Ac = 2.43 Xx 10-12 m is called the Compton wave- 
length of the eleciron. The Compton effect cannot be explained 
on the basis of the classical wave theory of light. N 

37.3.2 According to quantum theory, the Compton effect .is 
the result of elastic collision of the X-ray photon with a free or 
effectively free electron (in light atoms, the binding energy of 
an electron with the atom is substantially less than the energy 
of an X-ray photon). In this collision the photon transfers a 
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part of its energy and a part of its momentum to the electron 
in accordance with the laws of the conservation of energy and 
momentum. If the electron was ini- 

tially at rest, it follows from the law mU 

of conservation of energy that i 


hv + moc? = hv’ + me, (a) 


where v = c/A and v’ = c/d are the 
frequencies of the incident and scat- 
tered radiation, myc? is the rest ener- p 

gy of the electron (5.7.3), and mc? ph 

is the total energy of the electron Fig. 37.4 

(5.7.2) after the collision (energy of 

the Compton recoil electron). 

It follows from the law of conservation of momentum that 


Pph = mv H Pon 


or, in accordance with Fig. 37.4, 


mop = (2) 4 (== h2yy’ cos 0, (b) 


c c c? 


where v is the velocity of the Compton recoil electron. | | 
The mass of the ‘recoil electron is related to its velocity v by 
the equation (5.6.4) 


Mg 


"= Tao 


Combining equations (a), (b) and (c), we obtain 


(c) 


moc? (v — v’) = 2hvv’ sin? 7 

or 

Maa ein 
Myc 2 
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Thus, the Compton wavelength of the electron is 
= h 


C moe’ 
37.3.3 The kinetic energy of the recoil electron (5.7.1) is 


Ey = (m— my) c2=h (v—v’)=hvv’ (——— = er (My 
or 
0 
saa 
P " An m 2a sin 9 
bo a ee 
Aaa 1-|-2a sin? 


where a = Ac/A, and hv is the energy of the incident photon. 
The energy Ek is maximal at 0 = x. Thus 


2ahv 
Ek max = 79, ° 


37.3.4 When an electron is tightly bound to its atom, a photon 
scattered by the electron transfers encrgy and momentum, not 
to the electron, but to the atom asa whole. The mass of the atom 
is many times greater than that of the electron. Hence, only 
a small part of the photon’s energy is transferred to it, so that 
the wavelength A’ of the scattered radiation does not differ prac- 
tically from the wavelength A of the incident radiation. The 
percentage of electrons that are tightly bound in their atoms 
increases with the atomic number of the element and, conse- 
quently, with the atomic mass. Therefore, the heavier the atoms 
of the scattering substance, the greater the relative intensity 
of the unshifted component (A’ = A) in the scattered radiation. 
37.3.5 In contrast to the scattering of photons, accomplished 
by both free and bound electrons, only bound clectrons can ab- 
sorb photons. For example, in the external photoelectric effect, 
a photon is absorbed by a bound electron, which expends a part 
of the energy it receives on the work function (i.e. in doing work 
to escape from the substance), the work function being a mea- 
sure of how strongly the electron was bound in the substance. 
The absorption of a photon by a free electron is impossible, 
because this process would contradict the laws of conservation 
of energy and momentum. This can be readily shown by consider- 
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ing the absorption of a photon by a stationary free electron. 
It follows from the laws of conservation of energy and momen- 
tum that the following two relations must be simultaneously 
complied with: 


ee 
y 1— (0/0)? ` 


But these equations are simultaneous only at v = 0. 


hy 
(m — m c2=hv and mv = = where m = 


37.4 Wave-Particle Duality of the Properties of Light 


37.4.1 Such phenomena as interference (30.5.2) and diffraction 
of light (33.2.1) are convincing proof of the wave nature of 
light. At the same time, the laws of thermal radiation equilib- 
rium (36.1.2), the photoelectric effect (37.1.1) and the Comp- 
ton effect (37.3.1) can be successfully interpreted only on the 
basis of quantum concepts of light as being a flux of discrete 
photons (37.1.4). But the wave and quantum (particle) methods 
of describing light mutually complement, rather than contra- 
dict, each other, because light simultaneously possesses both 
wave- and particlelíke properties. Light is a dialectic unity of 
these opposite properties. 

37.4.2 The principal equations relating the wavelike proper- 
ties of light (frequency v and wavelength A in vacuum) to its 
particlelike properties (the energy Ep, and momentum pph 
of a photon) are: 

, h 

Lpn=kv and Pph = 7 
The wavelike properties of light play the decisive role in the 
laws of its propagation, interference, diffraction, and polari- 
zation; whereas the particlelike properties play no less impor- 
tant a role in the interaction of light with matter. The longer 
the wavelength of light, the less the momentum and energy of 
the photon, and the more difficult it is to detect the quantum 
properties of light. For example, the external photoelectric 
effect occurs only at energies of a photon greater than or equal 
to the work function in releasing an electron from a substance 
(37.1.5). The shorter the wavelength of electromagnetic radia- 
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tion, the higher the energy and momentum of the photons, and 
the more difficult it is to detect the wavelike properties of this 
radiation. For example, X rays are diffracted only by an 
extremely “thin refraction grating, the crystal lattice of a 
solid (33.3.4). 

37.4.3 Quantum optics employs the statistical approach in 
dealing with the laws of light propagation. According to this 
approach, the diffraction of monochromatic light by some ob- 
stacle (for instance, by a diffraction grating) consists in the re- 
distribution of the photons in space caused by this obstacle. 
The probability that the photons will hit various points on 
a screen mounted behind the obstacle is not the same. This ex- 
plains why a diffraction pattern appears on the screen: The 
irradiance EF, (36.4.2) of some small portion dS of the screen’s 
surface is proportional to the number dns,, of photons incident 
upon this portion (dS) per <econd. Consequently, Ee is propor- 
tional to the probability of photons hitting unit area of the 
screen's surface at the point being considered. On the other 
hand, according to wave conceptions, Ee is proportional to the 
square of the amplitude A of the light at the same point on the 
screen. Hence, the square of the amplitude of the light wave 
at some point in space is a measure of the probability of a pho- 
ton being at this point. 

37.4.4 Experiments in light diffraction indicate that in chang- 
ing the intensity of a light flux incident on an obstacle, the 
appearance of the diffraction pattern, i.e. the relation between 
the irradiances at various points of the screen, remains un- 
changed. This indicates that wavelike properties are inherent in 
each separate photon, and not only in an assembly of a great 
number of simultaneously travelling photons. As a photon pas- 
ses through an optical system it is impossible to indicate the 
particular point of the screen it will hit. It is feasible only to 
speak of the probability d#4 that the photon will hit some small 
portion dS of the surface of the screen. 


PART SIX ATOMIC 
AND MOLECULAR PHYSICS 


CHAPTER 38 ELEMENTS OF QUANTUM MECHANICS 


38.1 Wave-Particle Dualism of the Properties 
of Particles of Matter 


38.1.1 Quantum mechanics* deals with the physics of atoms, 
molecules and their associations, crystals in particular, as well 
as atomic nuclei and elementary particles.. Objects of the micro- 
cosm studied by quantum mechanics have linear dimensions 
of the order of 10-* to 10-13 cm. If the particles travel at veloci- 
ties v< c, where c is the velocity of light in free space, nonrel- 
ativistic quantum mechanics is applied; at v<e, relativistic 
quantum mechanics is resorted to.** 

38.1.2 Quantum mechanics is based on Planck’s concepts con- 
cerning the discrete nature of changes in the energy of atoms 
(36.3.2), Einstein's ideas on photons (37.1.4), and data on the 
quantum nature of certain physical quantities (momentum and 
energy, for instance) that characterize, under definite condi- 
lions, the states of particles of the microcosm. 

38.4. 3 The basic concept of quantum mechanics is that the wave- 
particle dualism of properties, established for light (37.4.1), 
is of a universal nature. It should manifest itself for any parti- 
cles having a momentum p. All particles with the finite momen- 
(um p possess wave properties and their motion is accompanied 
by a certain wave process. 


* Also called wave mechanics in its early stages of develop- 
ment. 

** Information on relativistic quantum mechanics is beyond 
the scope of the present handbook. Wherever applied, the term 
quantum mechanics refers to nonrelativistic quantum mechanics, 
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38.1.4 The de Broglie formula establishes the dependence of 
the wavelength associated with a moving particle of matter 
on the momentum p of the particle. Thus 

h h 


=.= 


p mv ’ 
where m is the mass of the particle, v is its velocity, and h is 
Planck’s constant (Appendix II). The waves mentioned here 
are called de Broglie, or matter, waves. 
Another form of the de Broglie formula is 


h 


where k = 2mnn/À is the wave vector, whose magnitude k = 
= 2n/X is the wavenumber, i.e. the number of wavelengths 
per 2x units of length, nis a unit vector in the direction of wave 
propagation, and fA = h/2n = 1.05 x 10-84 J-s. 

38.1.5 The de Broglie wavelength for a particle of mass m, 
having the kinetic energy Ek (3.2.1), is 


fee 

V 2mE, i 

In particular, for an electron accelerated in an electric field 
with a potential difference of AV volts (16.2.7), 


‘i = 


38.1.6 The de Broglie formula is confirmed by experiments in 
the scattering of electrons and other particles by crystals and 
in the passage of particles through matter. An indication of 
the wave process in all such experiments is the diffraction pat- 
tern observed, showing the distribution of the electrons (or other 
particles) in the particle detecting devices. 
No wave properties are manifested by macroscopic bodies. The 
de Broglie wavelengths for such bodies are so short that it 
proves impossible to detect any wave properties. 
38.1.7 The phase velocity (30.2.10) of de Broglie waves is 

W c? c? 


v = — — — = — mÀ 
ph k V h : 
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where m is the mass of the particle, v is its velocity, and A is 
the de Broglie wavelength. Since c œ v, the phase velocity 
of de Broglie waves is greater than the velocity of light in free 
space. The dependence of the phase velocity of de Broglie waves 
on the wavelength indicates that these waves undergo disper- 
sion (30.3.8). The group velocity (30.4.3) of de Broglie waves 
is equal to the particle velocity v. Thus 


Table 38.1 compares the particle and wave properties of parti- 
cles of mass m, travelling at the velocity v. 


TABLE 38.1 
s Wavelike properties 
: ; h h 
Velocity v De Broglie wavelength L ae 
: E me 
Momentum p= | De Broglie wave frequency v = = 
= mv 


Energy E=me? | Group velocity of de Broglie waves u=v 


Phase velocity of de Broglie waves vph= 
c2 


~ y 


38.1.8 Besides the de Broglie formula, it is assumed in quan- 
tum mechanics that the following relation exists between the 
total energy E of a particle and the frequency v of its de Broglie 
wave: 


E = Ww = ha, 


where œ = 2nv is the angular frequency (28.1.2) and A = 
= h/2m (38.1.4). 

38.1.9 De Broglie waves are of a specific nature having no analo- 
gies among the waves dealt with in classical physics. The square 
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of the magnitude of the de Broglie wave’s amplitude at a 
given point is a measure of the probability that the particle 
is to be found at this point (this is the probability, or statistical, 
meaning of de Broglie waves). The diffraction patterns, observed 
in the experiments mentioned in Sect. 38.1.6, are an indication 
of statistical laws. According to these laws the particles, reach 
definite places in the receivers, these being the places where 
the intensity (30.3.6) of the de Broglie wave is found to be 
highest. No particles are observed at the places where, accord- 
ing to the statistical interpretation, the square of the magnitude 
of the “probability wave” amplitude becomes zero. 


38.2 Schrédinger Wave Equation 


38.2.1 The position of a particle in space at a given instant 
of time is determined in quantum mechanics by specifying its 
Schrédinger wave function (also called the psi function or the 
probability amplitude) rp (x, y, z, 1). The probability dw that 
the particle is within the element dV of volume is proportional 
to | |? and to the element dV of volume. Thus 

dw = | 4 |? aV, 

where | 1p |? is the square of the magnitude of the psi function: 
| p |? = ynp*. Here w* is a function that is the complex conju- 
gate of wp. The quantity | w |? is the probability density | p |? = 
= dw/dV = p and specifies the probability of the particle oc- 
cupying the given point in space. The intensity of the de Brog- 
lie wave is determined by the quantity | w [?. 

38.2.2 Following from the definition of the psi function is the 
normalization condition for probabilities: 


\ | | (pitar 4, 


v 


in which the triple integral with respect to volume -is' calcu- 
lated along the coordinates z, yand z, from —œ to-+ 0, i.e. 
throughout the whole infinite space. The normalization condi- 
tion indicates that it is a certain event that the particle is some- 
where in space and the probability of this event should be uni- 


ty. l 
38.2.3 The wave function p(z, y, z, t)is the main. character- 
istic of the state of microobjects (atoms, molecules and ele- 
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mentary particles). This function is made use of to calculate the 
average value of some physical quantity Z that characterizes 
the object which is-in the state described by the wave function 
*», Thus 


(L)=\\ È L 1p |2 dz dy dz, 


where (LY is the average value of L and integration is carried 
out as indicated in Sect. 38.2.2. 

38.2.4 The fundamental differential equation of quantum me- 
chanics*, concerning the wave function p (z, y, z, #) is called 
the Schrödinger time-dependent wave equation. It determines 
the psi function for microparticles in a field of force with the 
potential energy U (z, y, z, t) (3.3.1) at the velocity v<& c, 
where c is the velocity of light in free space. The Schrödinger 
equation is of the form 


,§ 2) _ _ , 
th Ot om Ap+U (zx, Y, 2, t) p, 


where A is the Laplace operator, m is the mass of the particle, 


h = h/2n, h is Planck’s constant, and i= V —1. 

The Schrödinger equation is supplemented by the following 
conditions that:are imposed on the psi function: 

(a) the psi function should be finite, single-valued and contin- 
uous; 

(b) the partial derivatives @p/dx, ap/dy and dp/dz should be 
continuous; 

(c) the function |p [? should be integrable, i.e. the integral 
+00 


\ \ \ | p |? dV should be finite. In the simplest cases, this con- 


dition is reduced to the normalization of probabilities (38.2.2). 
38.2.5 In the case when the psi function is time-independent, 
i.e. p= (z, y, z), it satisfies the Schrodinger steady-state 


* See the second footnote to Sect, 38.1.1, 
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equation: 
2 
Ap + 


m 

h2 (E—U) y= 0, 
where Æ is the total energy of the particle. The remaining nota- 
tion is given in Sect. 38.2.4. The psi functions satisfying the 
Schrödinger equation for the given form of U = U (zx, y, 2) 
are called eigenfunctions. They exist only at definite values of E, 
called the eigenvalues of the energy. The whole set of eigenvalues 
of E forms the energy spectrum of the particle. If U is a monotonic 
function and U — 0 at infinity, then in the region EF < 0, the 
eigenvalues of the energy form a discrete spectrum. The most 
important problem of quantum mechanics is to find the eigen- 
values and eigenfunctions of particles (or systems of particles). 
38.2.6 Schrodinger’s time-dependent equation has the solu- 
tion: 


p(z, y, z, t)=Ņ (z, y, z)e Bh, 


The state of the particle at the given instant of time is described 
by a periodic function of time with the angular frequency 
w = E/ħ, determined by the total energy E of the particle. 
This corresponds to the relation between the total energy E of 
the particle and the de Broglie wave frequency (38.1.8). 

If a particle is in a definite energy state with the energy E = 
= const, the probability dw of finding it in the element dV of 
volume is independent of time. Thus 


dw = | p |? dV = y* dV. 


This is said to be the steady state of the particle. An atom in the 
steady state has constant energy and does not radiate electro- 
magnetic waves (39.1.7). 


38.3 Motion of a Free Particle 


38.3.1 In free motion of a particle (U = 0), its total energy E 
coincides with its kinetic energy. If the OX axis is taken along 
vector v of the particle’s velocity (v = const), Schrodinger’s 
steady-state equation (38.2.4) has the following solution: 


tp = Ae- G/A) V 2mE x + Bel™) V2mE x 
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where m is the mass of the particle, A = h/2n, h is Planck’s 
constant, and A and B are certain constants. The Schrödinger 
time-dependent equation (38.2.4) has, in this case, the solution 


(fo 


i pe = 
p (£, Y, Z, t)=Ae á i + Be 


which is the superposition of two plane monochromatic waves 
(31.1.6) of equal frequency œ = E/h, propagating, one in the 
positive direction of the OX axis with the amplitude A, and 
the other in the opposite direction with the amplitude B. 
38.3.2 In quantum mechanics a free particle is described by 
a plane monochromatic de Broglie wave with the wavenumber 
k (38.1.4): 


k= V 2mE. 


The probability of finding the particle at any point in space is 
constant. For waves propagating along the positive direction 
of the OX axis, 


lap [2 = ayyp* = JA (2, 


38.4 A Particle in a One-Dimensional Infinitely 
Deep Potential Well 


38.4.1 A potential well is a region of space in which the poten- 
tial energy U of a particle is less than a certain value Umax: 
In particular, at U = U (x) and Umax = œ, there is a one- 
dimensional potential well of infinite depth. If the potential 
energy of the particle outside and inside the potential well 
has the following values (Fig. 38.1): 


U=0 at O<2e<lL, 
U=o at «<0 and zL, 
the well has a “flat bottom”. 


The motion of collectivized electrons in a metal is dealt with 
in classical electronic theory as motion in a potential well, 
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the potential energy of the electron outside the metal equalling 
zero, and that inside the metal being negative and numerically 
equal to the work function of the metal (energy required to re- 
move an electron from the metal) (41.11.1). The potential well 
illustrated in Fig. 38.1 is simpler 
VA, l than the real potential well of elec- 
i l trons in a metal. 

38.4.2 The Schrödinger steady-state 
equation (38.2.5) for particles in a 
potential well, considered in the pre- 

ceding subsection, is of the form 


ze dap , 2m Z 
2=0 CL Xx Jz? Toa Ep=0 
Fig. 38.1 at the boundary conditions p (0) = 
= 1p (L) = 0, which indicate that 
Ņp=0 and k |2 = 0 outside the region 0< z< L, i.e. 
that the probability of finding the particle outside the potential 


well is zero. 
The solution of the Schrödinger equation is 


(x) = A cos kz + B sin kz, 


where A and B are constants, and k = V 2mE/h is the wavenum- 
ber (38.1.4). It follows from the boundary conditions that A = 0, 
B Æ 0 and sin kL = 0, i.e. the wavenumber can assume a set 
of discrete values that comply with the requirement: k,l = 
= nn, where n = 1, 2, 3,.... 
The last equation means that 
OE a E, eee 
An L a gee 
A whole number of de Broglie half-waves should fit into the 
length of the potential well. 
38.4.3 Physical quantities that can have only definite discrete 
values are said to be quantized (quantized physical quantities). 
The eigenvalues of the energy En of a particle in a one-dimen- 
sional infinitely deep potential well, 

n2n2h2 


En mn eL 23 cute Vis 
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comprise a discrete set of energy values that are quantized (see 
also Sect. 38.2.5). 

The quantized values of Ey are called energy levels, and the num- 
bers n, determining the energy levels of a particle in a poten- 
tial well, are called quantum numbers. 

38.4.4 At large quantum numbers (n> 1), the energy levels 
of a particle in a potential well become relatively closer to one 
another. The relation AE/En ~ 2/n< 1, where AE = Epi, — 
— En = (2n +1) n2n?2/2mL?. The inequality AE< En at 
n> 1 indicates that the quantization of energy at large quan- 
tum numbers yields results close to those obtained in classical 
physics: the energy levels become quasi-continuous (quasi- 
continuous energy levels at n> 1). 

Bohr’s correspondence principle states that at high quantum num- 
bers the conclusions and results obtained in quantum mechanics 
should correspond to those of classical mechanics. 

A more general statement of the correspondence principle is 
that between any theory developed from a classical one and the 
initial classical theory there must be a regular relationship: 
in definite limiting cases, the new theory must go over into the 
old one. For example, the formulas of kinematics and dynamics 
of the special theory of relativity become the formulas of New- 
ton’s mechanics at velocities where (v/c)? << 1 (5.3.4). Geometri- 
cal optics is a limiting case of wave optics if the wavelength 
can be neglected (A — 0). 


38.5 Linear Harmonic Oscillator 


38.5.1. A linear (one-dimensional) harmonic oscillator is a particle 
of mass m, vibrating with the natural angular frequency wọ 
(28.1.2) along a certain axis OX due to the action of quasi-elas- 
tic force F, which is proportional to the displacement x of 
the particle from its equilibrium position. Thus F = —kz. 
Here k is a factor of the quasi-elastic force, related to m and 
©, by. the equation: k= mw? (28.2.3). The potential energy U 
of the harmonic oscillator (28.2.3) is ` 


kx? 


i) aaa 
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38.5.2 In classical physics the amplitude (28.1.3) of small vibra- 
tion of a harmonic oscillator is determined by the margin of its 
total energy E (Fig. 38.2). At points B and A, with the coordi- 
nates +a, the total energy E is equal to the potential energy: 


ES E ae where a is the amplitude of vibration of the clas- 

Wel 

| t 

U | | 

| | 

A 8 | | 

| l | | 
| oes 

a2 0 aX a eo 

Fig. 38.2 Fig. 38.3 


sical harmonic oscillator. Such an oscillator cannot get outside 
the limits of the region (—a, + a). 

38.5.3 The probability w,) (x) dx of finding the oscillator on 
the line segment from z to z + dx in accordance with classical 
mechanics, 


4 ax 
wo] (2) De a 
(1-5) 
a 


is represented by the curve in Fig. 38.3. 

38.5.4 The vibration of a linear harmonic oscillator is investi- 
gated in quantum mechanics by means of the Schrödinger 
e equation (38.2.5): 


dp nog 
dr? I P(r ) v= 9, 


The solutions of this ee complyi:rg with the conditions 
in Sect. 38.1.4, are the eigenfunctions (wave functions) of a lin- 
ear harmonic oscillator: 


Un (x) = e—5?/2 Hn (§&), E= — 


fi 0 
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where Ly= V hj2nma, H,(§) is a Chebyshev-Hermite poly- 
nomial of the nth order: 


(—1)" ade 
O = amn va aE 


The eigenfunctions for n=O, 1, 2 are: 


4 —x2/2002 
pile) = == É 
V toy x 
— 2/222 
pi (x) EEE E o slat 
V 2% Vx To 
2 —x2/2x2 
po (2) = (44-2) e a o, 
V & Vx XO 


The node of a wave function is its value equal to zero. The num- 
ber of nodes of the function pn is equal to the quantum number n 
(38.4.3). 

38.5.5 The eigenvalues of the energy En of a linear harmonic 
oscillator, 


En= (n+), m= (n+) hoy (n=0, 1, 2, 225), 


where vy = ©ọ/27, and w, is the natural angular frequency 
(38.5.1), are the set of equally spaced energy levels shown in 
Fig. 38.4. At n>> 1 (where n + 1/2 x n), the energy levels of 
the oscillator coincide with the values of quantized energy of 
the oscillator Eh = nħ@p, postulated by Planck in the theory 
of blackbody radiation (36.1.5). 

38.5.6 The minimum energy that can be had by a harmonic 
oscillator is called the zero-point energy Eo: 


E, = at = “0 (at n=0). 


In classical physics and in Planck’s theory it was assumed that 
E = 0 (at n = 0). This means that the oscillator does not vi- 
brate and isina state of equilibrium. Atoms, as oscillators, 


35—0186 
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should not, according to classical physics, execute any vibra- 
tions at the temperature of absolute zero (T = 0). It has been 


W 


Fig. 38.4 Fig. 38.5 


proved in modern quantum mechanics, that the zero-point ener- 
gy of a harmonic oscillator cannot be reduced however much it 


-6 -5-4-3 ~ 


Fig. 38.6 


is cooled, even down to absolute zero (11.8.4). Zero-point energy 
of the oscillator corresponds to its zero vibration. In quantum 
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mechanics, zero-point energy is a typical feature of any system 
of particles. At temperatures close to absolute zero, a substance 
is in a condensed state and its atoms (or molecules or ions) are 
dealt with as vibrating oscillators. Zero-point energy is the 
least amount of energy that a quantum oscillator should have 
in its lowest energy state (at 

n = 0) in order to comply -F> 

with the indeterminacy relation i 

(38.6.2). kr 

38.5.7 The probability of find- 

ing a quantum linear harmon- 

ic oscillator on the OX axis in 

the region from z to x + dz is 


Mqu (x) dz = | Pp (z) |? dz. N $$$ $$ 
Compared in Fig. 38.5 are the “a 
quantum probability density at Fig. 38.7 

n = 4 and the classical probabi- 

lity density w,) (x). The existence 

of nonzero values of wqu (x) beyond the limits of the classical- 
ly allowed region | z | >a is explained by the feasibility of 
the passing of particles possessing wavelike properties through 
a potential barrier (38.7.2). 

38.5.8 As the number n is increased, the probability distribu- 
tion curve | , |?, illustrated in Fig. 38.6 for n = 10, becomes 
nore and more like the classical probability distribution curve 
(Pig ne This agrees with Bohr’s correspondence principle 
(38.4.4). 

38.5.9 Making use of the eigenfunctions of the energy of a linear 
harmonic oscillator, the average energy (EY of such an oscil- 
lator can be calculated: 


hW RW 


Pe OO i O 
=O + xp Gia, /kT) 1 


h 

Accurate to within zero-point energy, (£) is expressed by the 
term (£,), which was obtained by Planck in developing his 
theory of the thermal radiation of a blackbody. Graphically, 
this term is represented in Fig. 38.7, at T = const, asa func- 
tion of the frequency. It is evident that the greatest contribu- 
lion to (#,) is made by vibrations with low frequencies, corre- 
sponding to long wavelengths. 


ny 
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At a high temperature (kT >> hwo) 


Og 


if. 


(Ey) == 


This result coincides with that obtained from the principle of 
equipartition of energy (10.6.4). 


38.6 Heisenberg indeterminacy Principle 


38.6.1 The wavelike properties of microparticles (38.1.4) 
introduce limitations into the possibility of applying the con- 
cepts of coordinates and momentum in their classical form to 
such particles. 

Limitations also exist in classical physics in the application of 
certain concepts to definite items. It is of no meaning, for in- 
stance, to apply the idea of temperature to a single molecule; 
the concept of point localization (location at a certain point) 
is inapplicable in determining the position of a wave in space, 
etc. But in classical mechanics, a definite value of the particle’s 
coordinate corresponds to exact values of its velocity and mo- 
mentum. In quantum mechanics there exist fundamental re- 
strictions in the possibility of simultaneously determining the 
exact coordinate of a particle and its exact momentum. These 
restrictions are associated with the wave-particle dualism of 
the properties of microparticles (38.1.3). 

38.6.2 The Heisenberg indeterminacy relations are the inequali- 
ties 


h h 
, Ay X Apy > 5 ; Az XAP >T. 
Here Az, Ay and Az are the ranges of the coordinates within 
which a particle, described by a de Broglie wave (38.1.4), may 
be localized if the projections of its momentum along the coor- 
dinate axes are within the ranges Apy, Ap, and Ap,. The Hei- 
senberg relations indicate that the coordinates z, y, and z of 
the particle and the projections py, py and pz of its momentum 
on the corresponding axes cannot simultaneously have values 
exactly equal to z and px, y and Py, and z and p,. These physical 


v| = 


Az X Apy > 
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quantities can have values given to an accuracy determined by 
the Heisenberg relations. The more accurately the position of 
the particle is determined, i.e. the smaller the values of Az, 
Ay and Az, the less accurately the values are determined for the 
projections of its momentum (i.e. the greater are Ap,, Ap, 
and Ap,). If the position of the particle along the OX axis is 
exactly determined and Az = 0, then Ap, = oo and the value 
of the projection p, of the momentum on this axis becomes en- 
tirely indeterminate. 

38.6.3 The indeterminacy relations put definite restrictions in 
quantum mechanics on the feasibility of describing the motion 
of a particle along a certain path. 

In classical theory each point of the path of a particle has de- 
linite coordinates z, y, and z and the particle has a definite mo- 
mentum p with projections p,, py and p; along the coordinate 
axes. This is realized in quantum mechanics only in cases when 
the particle travels in a macroscopic region of space (for instance, 
the particle may leave a trace on a photographic plate or 
on the screen of an oscillograph). If, for example, the position 
of an electron is fixed with an accuracy determined by the linear 
dimensions of a grain of photographic emulsion, subject to the 
effect of the electron, then Az ~ 10- m. This corresponds to 
an indeterminacy in the momentum equal to Ap, > f#/2Ar zw 
~ 10-28 kg-m/s and in the velocity equal to Av, = Ap,/m % 
x 10? m/s. This indeterminacy at electron velocities of the order 
of 108 to 107 m/s,enables us to assume that the electron is trav- 
elling along a definite path with an exact velocity specified 
at each point of this path. 

38.6.4 If the particle is travelling in a microscopic region of 
space, the indeterminacy relations substantially affect the nature 
of its motion. For example, the position of an electron trav- 
clling in an atom can be determined with an accuracy to the 
dimensions of the atom, i.c. Ax ~ 10-1} m. But the indeter- 
minacy of the electron’s velocity Av,., in this case, is found to 
be of the same order of magnitude as the velocity itself: Av, ~ 
x 10° m/s ~ v. The path of an electron in an atom with the 
exact velocity specified at each point does not have the meaning 
it has in classical mechanics (1.2.3). This by no means indicates 
(hat the indeterminacy principle places fundamental limitations 
on our knowledge of the microcosm. It only indicates the limited 
applicability of the concepts of classical physics to the region 
of the microcosm. 
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38.6.5 No restrictions are made by the indeterminacy principle 
with respect to the feasibility of applying the conception of the 
coordinate and momentum of a macroscopic body in the classi- 
cal sense. The wave properties of such bodies are not manifested 
in any way (38.1.6) and, consequently, the indeterminacy rela- 
tions are not-applied to them. 

38.6.6 The indeterminacy relations for energy E and time ¢ are 


AE X At> ii, 


where AE is the indeterminacy of the energy of a particle, which 
is in a state with the energy E during the length of time At. 
The longer a particle is in the given state, the more accurately 
its energy in this state can be determined. 

38.6.7 Measurement is the process of interaction of an instru- 
ment with the item being investigated as a result of which cer- 
tain information is obtained on the properties of the item. This 
process proceeds in space and time and is an objective process. 
The interaction between the measuring instrument and the item 
being measured essentially differs for microscopic and macrosco- 
pic items. In the latter case, the measuring process is described 
with more or less accuracy by the laws of classical physics, and 
the instrument has no effect on the item being measured that 
cannot be accurately taken into account in the terms and con- 
cepts of classical physics or reduced to as small a value as de- 
sired. In quantum mechanics, owing to the objectively existing 
duality of the properties of microscopic items (38.1.3), the meas- 
uring process is inevitably associated with substantial effects 
of the instrument on the course of the phenomenon being investi- 
gated. For example, to determine the position of the electron it 
must be “illuminated” by light of the highest possible frequency. 
As a result of the collision between the photon and the electron, 
the momentum p,. of the latter is changed by an amount deter- 
mined by the indeterminacy relation: Ap, > h/2Az (38.6.2). This 
effect on the item being measured cannot be considered small 
or inessential because the state of the item is changed. This 
change is such that as a result of measurement, definite classical 
characteristics of a particle, for instance, its momentum, are 
found to be specified only within the limits restricted by the 
indeterminacy relations. 
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38.7 Tunnel Effect 


38.7.1 Potential wells in which particles are located may be of 
much more complicated shape than those dealt with in Sect. 38.4. 
If the potential energy of the particle is of the form U = U (r), 
as shown in Fig. 38.8, the particle moves in a field of forces that 


Umax 


Tmar 


Fig. 38.8 


can be represented in the form of a potential barrier. This means 
that the forces act on the particle in a certain limited region of 
space. Outside of this region the particles move as free ones. 
The height H of the barrier and its width a for a given particle 
depend upon the energy E of the particle. In order for a particle 
to get out of a ‘potential well or to enter it from outside, it is 
necessary, according to classical mechanics, to impart energy 
to the particle equal to or greater than | Unax — E | = H. 
38.7.2 The tunnel effect is the passing (“leaking”) of particles 
through potential barriers. The tunnel effect is a quantum-mecha- 
nical one and is due to the wavelike properties of the particles 
(38.1.3). 

The penetrability (penetrability factor) D of a potential barrier is 
the quantity 


= Ttran 


dine’ 
where tran is the intensity (30.3.6) of de Broglie waves passing 
through the potential barrier, and Jince is the intensity of the 
waves incident on the barrier, 
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The penetrability of a rectangular potential barrier of the height 
U, and width LZ (Fig. 38.9) is 


D=D, exp | -— = V 2m OBL |, 
where m is the mass of the particle, and Æ is its energy. 


For a potential barrier of complex shape 


x 


D=D, exp {—2 | Vim WEE a}, 


Xi 


where z, and z, are the coordinates of the points at the beginning 
and end of the potential barrier U (x) for a given value E of 


Fig. 38.9 Fig. 38.10 


the total energy (Fig. 38.10). In these formulas Dy is a constant 
factor close to unity. 

38.7.3 The tunnel effect plays an important role when the pene- 
trability is not too low. This occurs in the cases when the linear 
dimensions of the potential barrier are commensurable with 
atomic dimensions. For example, at Uy — E = 10 eV for an 
electron (m ~ 10- kg) and L = 10-1 m, we obtain D ~ 1/e. 
Under the same conditions, excepting that L = 10-2? m, D x 
~ exp (—108). The penetrability of the barrier decreases with 
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an increase in the mass of the particle and in the difference 
Uy = E. 

38.7.4 The paradox of the tunnel effect is that the passing of 
a particle through the potential barrier enables it to be found 
in a region that is forbidden from the classical point of view. 
In this region the potential energy of the particle exceeds its 
total energy; its kinetic energy is negative and, consequently, 
the velocity (momentum) of the particle becomes an imaginary 
quantity. 

Actually no such paradox exists. The tunnel effect is a purely 
quantum-mechanical phenomenon and here an unexpected 
difficulty arises, from the classical point of view, in represent- 
ing the total energy E in the form of the sum of the kinetic 
and potential energies: E = p?/2m -+ U (x). Owing to the fact 
that the indeterminacy relation (38.6.2) excludes the possibility 
of simultaneously specifying definite values, to any degree of 
accuracy, for the coordinate z and momentum p of a particle, 
it is found to be inadmissible in quantum mechanics to repre- 
sent the total energy as the sum of two exactly definite parts: 
the kinetic p?/2m and potential U (x) energies. If the coordina- 
te x of the particle is fixed in the region Az and its potential 
energy U (x) is determined with sufficient accuracy, then the 
indeterminacy Ap fntroduced into the value of the particle's 
momentum (Ap ~ Ā/Azx) excludes the possibility of specifying 
the exact value of the kinetic energy p?/2m of the particle. 
Consequently, the previous value of the particle’s total energy E 
cannot be specified either. It is found, in this case, that the change 
in the kinetic energy AE,, caused by fixing the particle's 
coordinate, exceeds the difference between the barrier height 
U, (Fig. 38.9) and the total energy E of the particle: 


AE, > U— E. 


Thus, AZ; exceeds the energy that the particle, inside the poten- 
tial well, lacks to surmount the barrier “in the classical way” 
to get out of the well. 

The role played by the tunnel effect in solid-state physics and 
in the phenomenon of radioactive decay is dealt with in Sects. 
44.8.3 and 42.5.2. 
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CHAPTER 39 STRUCTURE OF ATOMS 
AND MOLECULES AND THEIR 
OPTICAL PROPERTIES 


39.1 The Hydrogen Atom and Hydrogen-Like lons 


39.1.1 An atom is the smallest particle of matter possessing all 
the chemical properties of the given chemical element. An atom 
consists of a positively charged nucleus and electrons moving 
about in the electric field of the nucleus. The nuclear charge Ze 
(42.1.2) is equal in magnitude to the total charge of all the elec- 
trons of the atom. An ion is the electrically charged particle 
formed when an atom or molecule loses or gains electrons. 
39.1.2 The simplest atomic system is the hydrogen atom, 
consisting of a single proton in the nucleus and a single electron 
moving in the Coulomb feld of the nucleus. Hydrogen-like ions 
(isoelectronic with hydrogen) are such as Het, Lit+* and Bett, 
which also have a nucleus with the charge Ze and a single elec- 
tron. 

39.1.3 Among the optical properties of atoms, the most impor- 
tant is their emission spectrum. The frequency v of the lines 
in the discrete line spectrum of hydrogen is described by the 
Balmer-Rydberg formula: 


v=cR’ (4) =R (eN 


n? m2 \ nè? m? 
where 
4 
me : . 
R= Boone (in SI units), 
2m*me* : ; 
a (in Gaussian units). 


Here c is the velocity of light in free space, m is the mass of the 
electron, e is the electron charge, h is Planck’s constant, and 
& is the electric constant in SI units (14.2.7). 

Quantities R and R' = R/c are called the Rydberg constant in 
s7! and in cm~ or m, respectively: R == 3.2931193 x 1015 s-1 
and R’ = 1.0973731 Xx 107 m^, 
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The whole numbers n and m are called the principal quantum 
numbers, and m = n -+ 1, n+ 2, etc. A group of lines having 
the same n value is called a series. The hydrogen spectrum has 
(he following series: Lyman series at n = 1, Balmer series at 
n = 2, Paschen series at n = 3, Brackett series for n = 4, Pfund 
series for n = 5 and the Humphreys series for n = 6. 

l'or hydrogen-like ions (39.1.1) the Balmer-Rydberg formula is 


1 4 
ve mR (aT) 
where Z is the atomic number of the element in the Mendeleev 
periodic table (39.3.5). 
39.1.4 Each value of the quantum number n in the Balmer- 
Rydberg formula corresponds to the series limit with the maxi- 
mum frequency (at m = oo), called the term Tn. Thus 


R Z2? R TRAR 
| ae a (for hydrogen), Tn = a (for hydrogen-like ions). 


39.1.5 The energy of a hydrogen-like ion in a state with the 
principal quantum number n is: 
Z2 Rh | En | 


n2 E] f= h . 


The magnitude Æn is called the binding energy of the electron 
in the atom. The minimum value Æ, (at n = 1) corresponds 
to the ground, or normal, state of the atom (39.1.9). All energy 
values at n > 1 characterize excited states of the atom. The most 
vital distinguishing feature of excited states is the finite life- 
time t of the electron in these states: t ~ 10-8 s. In the ground 
state of the atom, isolated from all external effects, t is unlimi- 
ted. The maximum energy value Emax = 0 as n — oo corre- 
sponds to ionization of the atom or ion, j.e. to the removal of an 
electron from it. 

In magnitude, the ionization energy is equal to the binding 
energy of an electron in the atom (or ion). 

The ionization potential (16.2.6) of an atom of hydrogen or of 
a hydrogen-like ion in the state with the principal quantum 
number n is equal to V = Z?Rh/en?, where e is the magnitude of 
the charge of the electron. A diagram of the energy levels in the 
hydrogen atom and the arrangement of its series of spectral 
lines are given in Fig. 39.1, | 
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39.1.6 The spectrum and the energy levels of the hydrogen 
atom were first interpreted by means of Bohr’s postulates. 
Bohr’s first postulate (steady states postulate) states that there are 
a set of stationary, or steady, states of the atom in which it 
does not radiate electromagnetic waves. The steady states cor- 
respond to steady orbits (39.1.8) along which electrons travel 
with acceleration, but no light is radiated (cf. Sect. 31.3.4). 
Bohr’s first postulate was explained by quantum mechanics 
(38.2.6). 

Bohr’s second postulate (orbit quantization rule) states that in 
the steady state of the atom, the electron, travelling along a 
circular orbit, has quantized values of its angular momentum 
that comply with the condition 


Ly = mor = kì (k= 1, 2, 3, ...). 


llere m is the mass of the electron, v is its velocity, r is the ra- 
dius of the kth orbit, and % = h/2nx. The whole number k is 
equal to the number of de Broglie wavelengths for the electron 
(38.1.4) per length of the circular orbit: 2nr/A = 2armv/h = k. 
The quantization of the angular momentum in quantum mecha- 
nics is discussed in Sect. 39.1.8. 

Bohr’s third postuldte (the Bohr frequency condition) states that 
in going from one steady state orbit to another, the atom either 
emits or absorbs one photon. A photon is emitted when the atom 
passes from a state with more energy to one with less energy. 
Upon transition in the opposite direction, a photon is absorbed. 
The energy hv of the photon is equal to the difference in energy 
in the two states. Thus 


En — Em = bw. 


When En > Em a photon is emitted; when En < Em a photon 
is absorbed. 

In quantum mechanics, Bohr’s frequency condition follows from 
the theory of quantum transitions from one energy state to 
another. Information on this theory is beyond the scope of the 
present handbook. 

39.1.7 Schrédinger’s steady-state equation (38.2.5) for the 
motion of an electron in the Coulomb field of a nucleus with the 
charge Ze (42.1.2) is of the form 


Ap - at (EU) p==0, 
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where U (r) = —Ze?/A4neor is the potential energy of an electron 
located at the distance r from the nucleus, £ is the electric con- 
stant in SI units (14.2.7), and Æ is the total energy of the elec- 
tron in the atom, which it is necessary to determine under the 
assumption that the wave functions comply with the condi- 
tions indicated in Sect. 38.2.4. 

39.1.8 A solution of the Schrédinger steady-state equation for 
an electron in a centrally symmetrical Coulomb field of the 
nucleus yields the following: 

(a) the angular momentum of the electron in the atom is quan- 
-tized according to the formula: 


Lı =V I(LF1)h, 


where the orbital quantum number l, determining the angular 
momentum, varies within the limits: 17 = 0, 1, ..., (n — 1), 
where n is the principal quantum number (39.1.3); 

(b) at E < 0, when the electron is “bound” in the atom, its mo- 
tion is periodic, and the value of the total energy E is quan- 
tized. The eigenvalues FE, (38.2.5) are determined by the formula: 


Z*me* 1 ZZRh : 
| Snes nm? n2 (in SI units), 
272met 2 
En = a ee att (in Gaussian units). 
h2 n? 


Here R is the Rydberg constant (39.1.3). The solution of the 
Schrödinger equation for an electron in a hydrogen-like ion 
leads to energy levels of the Balmer-Rydberg type (39.1.4). 
39.1.9 In accordance with the orbital quantum number, the 
following letters have been assigned to designate the state of. 
the electron in atoms: 

s-state at l = 0, p-state at 1 = 1, 

d-state at l = 2, f-state at l = 3, etc. 

The s-state of the electron in the hydrogen atom at n = 1 is 
called the ground state (see also Sect. 39.1.5). This state is 
spherically symmetrical. The wave function of this state de- 
pends only upon the distance r of the electron from the nucleus 
[p = 1p (r)] and is of the form 


p=y (r)= Ce", 
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where C is a constant determined from the normalization con- 
dition for probabilities (38.2.2), and a, is the first Bohr radius: 


ANEJA? 


n (in SI units), 


ag = 


| ar bi ; 
dy =—z (in Gaussian units). 
me 


In quantum mechanics, the electron orbits in the atom are con- 
sidered to be the locus of points at which the probability of 
finding the electron is a maximum. In particular, for the s-state 
of the hydrogen atom, such an orbit is the first circular Bohr 
orbit of a radius equal to ap. 


39.2 Space Quantization 


39.2.4 Space quantization is the existence of definite discrete 

orientations in space of the angular momentum vector L; of 

the electron. This phenomenon has been proven in quantum me- 

chanics. The only allowed orientations of L; are those in which 

the projection Lı, of vector L; along the direction Z of the ex- 

cone magnetic, field assumes values that are multiples of i. 
us 


Liz = mh, 


where m is a whole number called the magnetic quantum number 
and assumes the values m = 0, +1, +2, ..., +l, and z is 
the orbital quantum number (39.1.8). Given in Fig. 39.2 is the 
space quantization of vectors L; for electrons in the p- and 
d-states (39.1.9). 

39.2.2 Since the angular momentum vector of the electron is 
proportional to the orbital magnetic moment of the electron 
(26.1.2), it follows from space quantization that the vector 
Pm of the orbital magnetic moment of the electron (26.1.2) in 
the external magnetic field cannot assume arbitrary orienta- 


tions. 
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39.2.3 It was experimentally established that space quantiza- 
tion is observed in atoms having a single outer-shell (valence) 
electron in the s-state (where J = 0) (39.1.9). In this state the 

atoms lack angular momen- 


ah tum (Ly = 0; see Sect. 
Ve + 39.1.8). Space quantization, 
detected in such experi- 
R ments, pertained to electron 
a 0 spin, and confirmed the exist- 
ence of two possible spin 
4 _4 orientations in the external 
magnetic field (26.1.3). 
“2h 


p- stale The magnitude of the spin 
angular momentum Ly, of 

d- state the electron is determined 
by the formula (39.1.8): 


Lis=V s(s+1)h, 


where s is the spin quantum number, equal to s = 1/2. Therefore, 
the numerical value of electron spin equals (26.1.3): 


nV Fre 


Space quantization of spin means that the projection Lisz 
of the spin vector L; on the direction of the external magnetic 
field is determined by the formula 


Fig. 39.2 


Lisz = msh, 


where ms is the spin magnetic quantum number, differing from 
the spin quantum number s in that it can assume two values: 
not only +1/2, but —1/2 as well*. 


* Frequently, no distinction is made between the two spin 
quantum numbers s and m,, referring, in this case, to m, to 
which two values, +1/2, are assigned. 
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39.3 Pauli Exclusion Principle. Mendeleev’s 
Periodic Table 


39.3.1 The simplest statement of the Pauli exclusion principle 
is: in any atom there cannot be two electrons occupying two 
identical steady states, which are specified by a set of four quan- 
tum numbers: the principal n, orbital 1, magnetic m and spin 
magnetic m, quantum numbers. 

Also obeying the Pauli principle, in addition to electrons, are 
other particles having half-integral spin (in # units). In any 
system of fermions (41.2.4) no two particles can exist in identical 
quantum states (41.1.4). 

For the electrons in an atom, the Pauli principle is written as 
follows: 


Z, (n, l, m, ms) = O or 14, 


where Z, (n, 1, m, ms) is the number of electrons in the state 
specified by the given set of quantum numbers. 

39.3.2 The maximum number Z, (n, l, m) of electrons occu- 
pying states specified by a set of three quantum numbers, n, l 
and m, and differing only in the orientations of the spins of the 
electrons, is . 


Zo (n, l, m) m 2, 


because the spin magnetic quantum number m, can assume only 
two values: 1/2 and —1/2. 

39.3.3 The maximum number Z, (n, l) of electrons occupying 
states specified by two quantum numbers, n and J, is 


Z, (n, D) = 2 (21 + 1). 


This is based on the fact that for a given l value vector L; can 
assume (2l + 1) different orientations in space. 

39.3.4 The maximum number of Z (n) electrons occupying states 
specified by the value n of the principal quantum number is 


i=n-1 


Z(n)= >) Zg(n, l)=2n?. 
1=0 


Table 39.4 lists the maximum number of electrons occupying 
states characterized by the given values of the principal n and 
orbital 1 quantum numbers. 


36—0186 
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TABLE 39.1 
Number of electrons occupying the states Wasimi 
n| Shell number of 
s (l = 0) p= vfau=2[su=3) g (l= 4)| electrons 
1) K 2 — — — — 2 
2| L 2 6 — — — 8 
3), M 2 6 10 — — 18 
4) N 2 6 10 14 — 32 
oj O 2 6 10 14 18 50 


39.3.5 The systematics for filling the electron states in atoms and 
the periodicity with which changes occur in the properties of 
chemical elements enabled them all to be arranged into Mende- 
leev’s periodic system of elements. Modern theory of the periodic 
system is based on the following principles: 

(a) The atomic number Z of a chemical element is equal to the 
total number of electrons in an atom of the given element. 

(b) The state of the electrons in an atom is determined by a set 
of four quantum numbers: n, l, m and ms. The distribution of 
the electrons according to the energy states should comply with 
the principle of minimum potential energy: as the number of 
electrons is increased, each consecutive electron should occupy 
the energy state with the least possible energy. 

(c) The filling of the energy states in the atom by electrons should 
proceed in accordance with the Pauli principle (39.3.1). 
39.3.6 An electron shell is a collection of all the electron states 
in an atom, which have the same value of the principal quantum 
number n (39.1.3). Distinction is made between the following 
electron shells (Table 39.1): K at n = 1, Lat n = 2, M at n = 3, 
N at n= 4, etc. 

Inside the electron shell, the electrons are distributed among 
the subshells, each of which corresponds to a certain value of the 
orbital quantum number l (39.1.8). | 
39.3.7 The order for the filling of the electron energy states in an 
atom in the shells and, within one shell, in the subshells, should 
correspond to the sequence of the energy levels with the given n 
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and Z values, and comply with the Pauli principle (39.3.5b). For 
light-weight atoms, this order consists of first filling the shell 
with the lower value of n, and only then filling the next shell with 
electrons. Inside a given shell, first the state with 7 = 0 is filled, 
then the states with larger and larger J values up to 1 = n — 1. 
Such an ideal periodic system of elements would have a structure 
and number of elements in each period (lengths of the periods) 
corresponding to Table 39.4. 

39.3.8 Deviations from the order indicated in Sect. 39.3.7 begin 
with potassium (Z = 19) and are explained as follows. The inter- 
action between electrons in the atom leads, at sufficiently large 
principal quantum numbers n, to a situation in which states with 
larger n and lesser / values can have less energy, i.e. be more favour- 
able from the energy viewpoint than states with less n, but with 
higher 7 values. As a result we have elements with incompletely 
filled preceding shells, but in which subsequent shells are being 
filled. Chemical elements, in which the filling of preceding shells 
is completed after the partial filling of subsequent shells, are called 
transition elements. 

39.3.9 The outer (valence) electrons of an atom are those in the 
shell with the highest n value that belong to the s- and p-sub- 
shells, i.e. they have Z = 0 or l= 1 (39.1.9). These electrons 
determine the chemical and optical properties of the atoms (see 
also Sect. 20.3.1). The total number of electrons in the s- and p- 
subshells is equal to 8 (Table 39.1). Most chemical reactions are 
based on giving up or attaching outer (valence) electrons. It 
proves expedient; theoretically, for an atom, with the (s+ p)- 
subshell less than half-filled to give up valence electrons, and for 
one with a more than half-filled (s -+ p)-subshell toattach more 
electrons from other atoms in the process of chemical reactions. 


39.4 Chemical Bonds and Molecular Structure 


39.4.1 A molecule is the smallest particle of a given substance, 
possessing its principal chemical properties. Molecules consist of 
either the same or different atoms (39.1.1), joined together by 
interatomic chemical bonds. Chemical bonds are based on various 
interactions of the outer (valence) electrons of the atoms (39.3.9). 
‘This is indicated by the changes in the optical spectrum, which 
explains the behaviour of these electrons when molecules are form- 
ed of atoms. 


36% 
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A definite amount of work must be done to dissociate molecules 
into their constituent atoms. On the contrary, the formation of 
molecules is accompanied by the evolution of energy. This shows 
that there are forces which bind atoms into molecules. The energy 
evolved in forming molecules is a measure of the forces of inter- 
action that join atoms into molecules. 
39.4.2 At large distances r between atoms, distances that consid- 
erably exceed the linear dimensions d of the atoms, forces of 
mutual attraction exist between them. At distances r comparable 
to d, the forces become ones.of mutual repulsion, which do not 
permit the electrons of a 
Ep [r) given atom to penetrate too 
deeply into the electron shells 
of an adjacent atom. As in 
the case of interaction be- 
tween molecules (12.1.4), the 
repulsive forces between atoms 
are more short-ranged, than 
those of attraction (Fig. 12.1). 
The signs of the forces of at- 
traction and repulsion are 
discussed in Sect. 12.1.4. 
39.4.3 At a certain distance 
rọ between the atoms of a 
diatomic molecule, the op- 
positely {directed forces of attraction and repulsion counter- 
balance each other and their vector sum equals zero. The 
distance rọ corresponds to the minimum mutual (potential) 
energy Ey (r) of the atoms of a diatomic molecule (cf. Sect. 12.1.6) 
(Fig. 39.3). The equilibrium interatomic distance rọ in a molecule 
is called the bond length, or distance. The quantity D is called the 
dissociation, or bond, energy. It is numerically equal to the work 
that must be done to dissociate a molecule into its constituent 
atoms and to separate them to infinite distances from one another. 
The dissociation energy is equal to the energy evolved in forming 
a molecule, but is of opposite sign. Dissociation energy is negative, 
whereas the energy evolved in forming molecules is positive. 
39.4.4 Heteropolar (ionic) molecules are ones formed as the result 
of the conversion of interacting atoms into oppositely electrically 
charged and. mutually attracting ions (39.1.1). This kind of bond 
between the atoms of a molecule is called a heteropolar (or ionic 
or electrovalent) bond. The formation of heteropolar molecules is 


Fig. 39.3 
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based on the behaviour of atoms with differently filled outer 
(s + p)-subshells, discussed in Sect. 39.3.9. Typical heteropolar 
(ionic) molecules are those of the alkali-halide salts, formed by 
ions from the elements of the I and VII groups of the periodic 
system (39.3.5). These include: NaCl (Na+Cl-), CsI (Cs+I-) and 
others. Metals of the first group have low ionization potentials V 
(39.1.5), whereas atoms of group VII are characterized by a high 
electron affinity, which is the amount of energy released when an 
electron is attached to a metalloid atom. The transition of an 
electron from a metal atom to a metalloid atom leads to the forma- 
tion of ions, each of which has a stable eight-electron (s + p)- 
subshell (39.3.9). As a result, a stable heteropolar (ionic) molecule 
is formed. 

39.4.5 Homopolar (atomic) molecules are ones formed by the mu- 
tual attraction of neutral atoms. The chemical bond of the atoms 
of a homopolar molecule is said to be covalent. A covalent bond 
has the property of being saturated (saturation of a covalent 
bond), which is manifested as the definite valency ofthe atoms. 
An atom of hydrogen can be bound to only one other atom, and 
an atom of carbon to no more that four other atoms. The simplest 
molecule with a covalent bond is the hydrogen molecule H,; 
it consists of two electrons and two nuclei—protons. 

The basis of the quantum-mechanical explanation of the covalent 
bond in the H, molecule is the indistinguishability of identical 
particles, e.g. the electrons in a hydrogen molecule (see also 
Sect. 41.1.2). The electrons in the H, molecule, each of which 
“belongs” to a definite nucleus, can be interchanged and this still 
leads to the same system: an H, molecule consisting of two elec- 
trons and two nuclei. The indistinguishability of electrons leads 
to the occurrence of a special quantum-mechanical exchange interac- 
tion between two identical electrons. This interaction consists 
in the electron of each of the atoms in the hydrogen molecule 
spending a certain part of its time at the nucleus of the other atom. 
This constitutes the bond between the two atoms forming the 
molecule. It follows from quantum-mechanical calculations that 
when two hydrogen atoms approach each other to a distance 
commensurable with the Bohr radius (39.1.9), and under the 
condition that the spins of the electrons in the atoms are anti- 
parallel (26.1.3), the two atoms attract each other and a stable 
H, molecule is formed. With parallel spins of the electrons, the 
two atoms of hydrogen repel each other, and no molecule of hydro- 
gen is formed. 
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39.5 Optical Properties of Molecules. 
Molecular Spectra 


39.5.1 Owing to their specific appearance, the spectra of molecules, 
or molecular spectra, are called band spectra. They consist of a set 
of more or less wide bands formed of closely spaced spectral lines. 
Bands are observed in molecular spectra in the infrared, visible 
and ultraviolet ranges of the scale of electromagnetic waves, 
(31.4.3). Closely spaced bands form a group of bands. Several 
groups of bands are observed for the simplest diatomic molecules. 
Continuous broad emission (or absorption) bands are observed in 
the visible and ultraviolet regions of the spectra of polyatomic 
complex molecules. 

39.5.2 As in the spectra of atoms (39.1.3), a separate spectral 
line in a molecular spectrum appears as a result of a change in the 
energy of a molecule. The total energy Z™ of a molecule can be 
presented as the sum of the following parts, assumed independent 
in a first approximation: 


EM — EM. AM) AM AM pM 


where £;" is the energy of translational motion of the centre of 
mass (2.3.3) of the molecule, FẸ is the energy of motion of the 


electrons in the molecule’s atoms, E\? is the energy of vibrational 
motion about their equilibrium positions of the nuclei in the mole- 
cule’s atoms, E>" is the energy of rotary motion of the molecule as 


a whole, and Fp is the energy of the nuclei in the molecule’s 


atoms. The energy Æ} is not quantized (38.4.3) and its-change 
cannot lead to the production of a molecular spectrum; the effect 
of #,’ on the molecular spectrum can be neglected in a first ap- 


proximation. The energy Z’™ of the molecule, whose change gov- 
erns the molecular spectrum, constitutes the sum of three com- 
ponents: 


vm pm; pm pm 
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39.5.3 According to the Bohr frequency condition (39.1.6), the 
frequency v of the photon emitted by the molecule when its energy 
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state changes is equal to 


Agm AE™ AEM AE™ 
Vag a ee S 


where AE’, AE} and AE™ are the changes in the corresponding 
parts of the energy E’™, which assume discrete quantized values. 
The production of closely spaced lines, forming bands at various 
parts of the spectrum, is explained by the fact that 


AEM AE? AEP, 


In the far-infrared region of the spectrum (wavelengths of the 
order of 0.4 to 1 mm), the transition of the molecule from one rota- 
tional energy level to another produces spectral lines of a rotation 
spectrum. 

39.5.4 In the infrared region of the spectrum (wavelengths from 
one to several dozen micrometres), transitions of molecules be- 
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Fig. 39.4 


tween vibrational energy levels produce the lines of the vibrational 
spectrum of the molecule. Upon changes in the vibrational energy 
levels of the molecule, its rotational energy levels are also changed 
at the same time. Hence, transitions of molecules between vibra- 
tional levels are vibrational-rotational transitions, producing 
a vibrational-rotational spectrum with the frequencies vyr 
(Fig. 39.4). This spectrum consists of groups of closely spaced lines 
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determined by the fact that the given vibrational transition is 
accompanied by rotational transitions. 

39.5.5 The visible and ultraviolet regions of the spectrum are 
produced as a result of transitions of the molecule between the 
various electron energy levels. Each electron energy level corre- 
sponds to various possible vibrations of the nucleus in the molecule, 
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Fig. 39.5 


i.e. to a set of vibrational energy levels. Transitions between 
electron-vibrational levels lead to the production of an electron- 
vibrational spectrum of the molecule, characterized by the frequen- 
cies Vey of the various lines (Fig. 39.5). Since each vibrational 
energy state corresponds to a system of rotational levels (39.5.4), 
each electron-vibrational transition corresponds to a_ certain 
band. The whole electron-vibrational spectrum of the molecule in 
the visible and nearby regions consists of a system of several 
groups of bands. 


39.6 Raman Scaftering of Light * 


39.6.1 Raman scattering of light, or the Raman effect, refers to 
the shifted frequencies vg and v, observed, along with the frequen- 
cy vo of the source of radiation, in the spectrum of light scattered 
by a solid or liquid body. A spectral line with the frequency 
Vs = Vo — v is called a Stokes (red) line; one with the frequency 
Va = Vo + v is called an anti-Stokes (violet) line. The set of lines 


* Commonly called combination scattering in the Soviet lite- 
rature. 
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with the frequencies v and v, form the Raman spectrum of the 
molecule, containing red and violet satellites with the frequency vo. 
The intensity of the violet satellites is less than that of the red 
ones, but increases with the temperature. The intensity of the 
red satellites is practically independent of the temperature. 

39.6.2 The quantum-mechanical explanation of the Raman scat- 
tering phenomenon consists in an analysis of the interaction be- 
tween a photon of incident light having the frequency v, and a 
molecule at the ground vibrational energy level with the ener- 
gy ET,- If, as a result of the interaction with the photon, the 
molecule is moved to higher vibrational energy level with the 


energy Es, > Ey), theenergy AE™ = E5, — Ey required for 
this purpose is borrowed from the incident light. A photon with 
the energy hv, is absorbed and, in its place, a photon is produced 
with lower frequency v and an energy equal to hv = hv, — AE™, 


At this the frequency v appears in the scattered light. It equals 
AE™ 


ME OS a 

and corresponds to the frequency of a red satellite. The whole set 
of red satellites are produced in transitions of the molecule to 
various excited vibrational energy states. 

39.6.3 If a molecule at the vibrational energy level with the ener- 
gy E5, (39.6.2) moves, due to interaction with a photon having 
the energy hv, to an energy state with less energy (EM) a violet 
satellite of the frequency v, is produced in the scattered light 
spectrum. This frequency 


AF™ 
Va=Vot ji 


where AEM = E5, — EXD. 

The probability of Raman scattering with an increase in frequency 
is less than the same scattering with a reduction in frequency. 
This is due to the fact that the number of molecules at the higher 
energy levels is less than the number at the lower levels. Conse- 
quently, the intensity of violet satellites is less than that of the 
red ones. When the temperature is raised, the molecule popula- 


\ 
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tion of the upper energy levels increases, as does the intensity of 
the violet satellites. The number of molecules in the ground 
state changes only slightly upon heating. Therefore, the intensity 
of the red satellites, remains practically unchanged as the tem- 
perature is raised. 


39.7 Luminescence. X rays 


39.7.1 Luminescence is the radiation of light by bodies at a tem- 
perature T, which is over and above the thermal radiation (36.1.1) 
at this temperature. The duration of luminescence substantially 
exceeds the periods of radiation of atomic systems. Depending 
upon the method applied to excite the luminescent glow, distinc- 
tion is made between cathodoluminescence, electroluminescence, 
photoluminescence and chemiluminescence. Luminescent sub- 
stances are called luminophors. 

39.7.2 Luminescent radiation is a nonequilibrium one (cf. 
Sect. 36.1.2) and is caused by a comparatively small number of 
luminescence centres—atoms, molecules or ions—put into an excit- 
ed state by the source of luminescence. The return of the excited 
centre to its ground or less excited state is accompanied by lumi- 
nescent radiation. The duration of this radiation is determined by 
the duration of the excited state, which depends on the properties 
of not only the radiating centres, but of the medium surrounding 
them as well. The duration of a metastable excited state can reach 
10-4 s, thereby correspondingly increasing the duration of the 
luminescence. 

39.7.3 Luminescence that immediately ceases together with the 
action of the glow exciting source is called fluorescence. 
Luminescence that persists long after the action of the glow excit- 
ing source has ceased to act is called phosphorescence. 

The phenomenon of fluorescence is associated with the transitions 
of atoms, molecules or ions from ordinary excited states with 
a duration of the order of 10-8 s to the ground state. Phosphores- 
cence, which provides for prolonged glow, is due to the transition 
of luminescence centres from metastable states to the ground state 
(39.7.2). The subdivision of luminescence into fluorescence and 
phosphorescence is arbitrary; it is sometimes difficult to establish 
a time-based boundary between them. 

39.7.4 The excitation of luminescence by means of an electron 
beam is feasible provided that the kinetic energy of a bombarding 
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electron satisfies the inequality 


mv? 


2 


= Ee— Eg, 


where Ee and Eg are the total energies of the luminescent particle 
in the excited and ground states. 

39.7.5 Photoluminescence is excited by electromagnetic radiation 
in the visible or ultraviolet ranges and complies with the Stokes 
law: the wavelength of photoluminescence is usually longer than 
that of the exciting light. The quantum-mechanical basis for the 
Stokes law is that in the absorption of a photon of exciting light 
with the energy hv, a photon with the energy hvjym, less than hv, 
is produced. The surplus energy hv — hvjym = E™, where E™ is 
energy expended on various processes except photoluminescence. 
As a rule, EM > 0 and vjym < v, i.e. Ajum > A in accordance 
with the Stokes law. Sometimes, anti-Stokes luminescent radiation 
is observed. It complies with a condition that is the opposite of 
the Stokes law, i.e. Ajum < à. This occurs when a definite part 
of the energy of thermal motion of the particles of the lumines- 
cent substance is added to the energy hv of the photon that excites 
the luminescence. Thus 


hVium ae hv + akT, 


where a is a factor depending upon the nature of the luminescent 
substance, k is Boltzmann’s constant (8.4.5), and T is the absolute 
temperature. 

39.7.6 Photoluminescent energy efficiency is the ratio of the energy 
of luminescent radiation to the energy from the source absorbed 
under steady-state conditions by the luminophor. 

The quantum yield of photoluminescence is the ratio of the number 
of photons of photoluminescent radiation to the number of photons 
of the exciting light, the latter having constant energy. Vavilov’s 
-law states that the photoluminescent energy efficiency increases 
proportionally to the wavelength À of the absorbed radiation. Then, 
after reaching a maximum valuc in the wavelength range from A 
to Amax, drops rapidly to zero upon a further increase in wave- 
length. With an increase inthe wavelength of the exciting light 
of given energy, there is an increase in the number of photons 
contained in the given energy of primary radiation. Each of 
these photons can produce a photon of the energy hyljum. There- 
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fore, the photoluminescent energy efficiency increases with the 
wavelength 4. The sharp drop in energy efficiency at à = Amay 
indicates that photons with the frequency less than vmin = ¢/Amax 
cannot excite the particles of the luminophor. 

39.7.7 X rays are electromagnetic waves with a wavelength from 


0.01 to 800 A that are produced by braking high-speed electrons 
by a substance. There are two types of short-wave X rays. 

At electron energies, not exceeding a certain critical value that 
depends upon the substance in which the electrons are braked, 
white, or continuous, X rays are produced. It is radiated by braked 
electrons and has a continuous spectrum, bounded on the short- 
wavelength side by a certain boundary Appin, called the contin- 
uous-spectrum boundary. The boundary wavelength Amin de- 
pends upon the kinetic energy Ex of the electrons that produce the 
bremsstrahlung and is reduced with an increase in the kinetic 
energy Eg. The existence of Amin is explained by the fact that the 
maximum energy hvmax Of a photon of X rays, produced at the 
expense of the energy of the electron, cannot exceed Eg. Thus 


hvmax = Ex. 


Consequently, Amin = ¢/Vmax = ch/E,. Making use of data on 
the values of Ex and Amin, this formula enables Planck’s constant 
(Appendix II) to be determined. In its time this method was one 
of the most accurate and reliable ones. 

39.7.8 The second type of X rays, characteristic X radiation of 
atoms of matter, has a line spectrum. It is an individual character- 
istic of a substance and does not change when the substance enters 
a chemical compound. It follows that, contrary to optical spectra, 
characteristic X rays are associated with processes in the deep- 
lying filled inner electron shells of atoms (39.3.6), which undergo 
no changes in chemical reactions of the atoms. Line X-ray spectra 
consist of lines composing several series. The same types of line 
series are observed in various elements, differing only in that the 
like series of lines in the heavier elements are shifted toward the 
side of shorter wavelengths. 

39.7.9 In the order of increasing wavelengths, the series of char- 
acteristic X rays are called the K-, L- M-, and N-series. As an 
electron is removed from one of the inner shells of an atom with 
the nuclear charge Ze, the vacant space is occupied by an electron 
from a shell farther from the nucleus and an X-ray photon is 
emitted. Thus, after removing an electron from the K-shell, the 
filling of the vacancy by electrons from L-, M- and other shells, 
produces the K,-, Kg- and K,-lines that form the K-series, 
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‘The frequency v of the lines of characteristic X rays can be found 
by Mosley’s law: 


V v=a(Z—D), 


where a is a constant for the given series of lines (in s-1/), and b 
is the screening constant. This constant means that an elec- 
tron, accomplishing a transition corresponding to a certain line, 
is subject, not to the whole charge Ze of the nucleus, but to the 
charge (Z — b) e, which has been weakened by the screening action 
of other electrons. 


39.8 Stimulated Emission of Radiation. Lasers 


39.8.4 An atom located in an electromagnetic field at an excited 
energy level can, with a certain probability, be transferred by 
the effect of the field to a lower energy state. It is as if the electro- 
magnetic field “tumbles” the atom down from its excited level 
to the ground state or only toa less excited state. Such a transi- 
tion is accompanied by stimulated (induced) radiation of the sub- 
stance, produced by the action of an electromagnetic wave on it. 
39.8.2 From the viewpoint of wave optics (32.1.1) the phenomenon 
of stimulated emission of radiation means that as the electromag- 
netic wave passes through a substance its intensity (31.2.4) is 
increased, i.e. negative absorption of light occurs. But the frequency 
ER of the wave, direction of its propagation, its phase (30.2.5) 
and polarization (31.1.7) remain unchanged. Stimulated emissio n 
of radiation is strictly coherent (30.5.1) to the electromagnetic 
wave passing through the substance and producing the emis- 
sion. 

39.8.3 From the quantum-mechanical point of view, the coherence 
mentioned in Sect. 39.8.2 means that the new photon, produced 
as a result of the stimulated emission, in no way differs from the 
photon it was produced by. The new photon, produced in stimulat- 
ed emission, strengthens the light passing through the medium. 
As is shown in Fig. 39.6b, the process of stimulated emission leads 
to the production of two photons with the energy hv in place of 
one such photon. But, in addition to stimulated emission, ab- 
sorption of light also occurs. As a result of the absorption of a pho- 
ton by an atom at the energy level £,, the photon disappears and 
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the atom moves to the energy level E, (Fig. 39.6a). This process 
reduces the intensity of the light that passes through the sub- 
stance. 

39.8.4 A medium is said to be active if in it the intensity of 
transmitted light increases. This indicates that in an active medi- 
um the process of stimulated emission predominates over the 
process of light absorption. Otherwise, when the chief role is played 
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by the absorption of light, the medium weakens the radiation 
passing through it. An active medium is also called one with 
negative absorption (39.8.1). 

39.8.5 The absorption of light in matter proceeds in accordance 


with the Bouguer-Lambert law: I = I,e~*”, where œ > 0 is the 
absorption factor, zx is the thickness of the absorbing layer, J, is 
the intensity (31.2.4) of light at the entrance to the layer (at z = 
= 0), and J is the intensity of the light that has passed through 
the layer of thickness x. The Bouguer-Lambert-Fabrikant law is 
valid for a medium with negative absorption. Thus 


I = ILe”, 


where |&œ |>0 is a positive quantity corresponding to the strength- 
ening, rather than the weakening, of light passing through an 
active medium. At this the intensity of the light increases drasti- 
cally with the thickness of the layer of the medium (Fig. 39.7). 
In other words, the absorption factor a is a negative quantity 
for an active medium. 
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39.8.6 Three types of optical processes, illustrated in Fig. 39.8 
can occur between two energy levels with the energies £; and Eo. 
Under conditions in which spontaneous emission can be neglected, 
i.e. radiation in which the excited atoms (or molecules or ions) 
spontaneously drop to the ground state, the optical properties of 
a medium are determined by competition between two processes: 
absorption and stimulated emission. The number of events of 
absorption is proportional to the concentration N, of particles 
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with the energy E, at the lower energy level. The number of events 
of stimulated emission is proportional to the concentration N, of 
particles at the upper energy level. The absorption factor œ in the 
Bouguer-Lambert-Fabrikant law (39.8.5) is proportional to the 
difference between the numbers of events of absorption and 
stimulated emission. Thus 


a = k (Ny — No), 


where k > 0 is the proportionality factor. 

39.8.7 When the system is in a state of thermodynamic equilib- 
rium (8.3.3), Na < Nı and a > 0. This means that the number of 
events of absorption usually exceeds the number of transitions 
accompanied by stimulated emission of radiation. To obtain 
a medium with a negative absorption factor it is necessary to 
provide for a nonequilibrium state in the system, at which N, > 
> N,. Such states are said to be inverted. In such a state the 
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number of events of stimulated emission of radiation exceeds the 
number of events of absorption. 

39.8.8 Lasers (from the acronym for “Light Amplification by the 
Stimulated Emission of Radiation”), or generators of coherent 
light, are sources of light operating on the principle of stimulated 
emission of radiation in an active medium with an inverted 
population of energy levels (39.8.7). If these devices operate in 
the optical range, they are known as lasers proper. Those that 
operate in the range of ultrashort radiowaves are called masers 
(Microwave Amplification by Stimulated Emission of Radia- 
tion). 

39.8.9 The process of obtaining an inverted state (population 
inversion) in a medium (39.8.7), required for laser operation, is 
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called pumping the active medium. In practice, pumping is accom- 
plished by the three-level procedure. One of the first lasers operating 
according to a three-level pumping procedure had aruby for the 
active medium (alumina Al,O, with the addition of chromium 
oxide Cr,O3). In the crystal lattice of alumina, a part of the Al 
atoms are replaced by Cr+ ions, which serve as the active sub- 
stance accomplishing the transitions with stimulated emission. 
A schematic diagram of the energy levels of Cr+ is given in 
Fig. 39.9. The nearest to the ground-state level C are two broad 
energy bands A (41.8.2) and the double metastable level B. Inten- 
sive radiation, applied to the ruby by green light of a powerful 
pumping flashtube, filled with neon and krypton, transfers the 
chromium ions to the level of the A band. From here radiationless 
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transitions occur to the B level. The surplus energy is imparted 
to the ruby crystal lattice. As a result, a population inversion 
of the chromium ions at the B and C levels is obtained (39.8. 7) 
and the laser emits red light of the wavelengths 6927 A and 6943 A, 
corresponding to chromium ion transitions from level B to lev- 
el C. 

39.8.10 The avalanche-type increase in intensity in an active 
medium (39.8.5) indicates that such a medium operates as an 
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amplifier of electromagnetic waves. The light amplification effect 
in a laser is increased by passing the light many times through 
the same layer of the active medium. This takes place by means 
of the arrangement shown in Fig. 39.10a. Photon 4, travelling 
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parallel to the axis of the active medium (a ruby rod) /, gives 
rise to an avalanche of photons moving in the same direction 
(Fig. 39.10b). A part of this avalanche passes through semitranspar- 
ent mirror 3 and leaves the laser, and a part is reflected and 
incfeases again in the active medium (Fig. 39.410c). A part of the 
dvalariche of photons that reaches the nontransparent mirror 2 
is absorbed by it; but after reflection from this mirror the rest 
move in the same direction as the initial “starting” photon A 
(Fig. 39.10d). The multiply amplified stream of photons, leaving 
the laser through the semitransparent mirror 3, produces an ex- 
tremely intense pencil of light rays, highly directional and with 
very little angular spread. 

Photons B and C (Fig. 39.10b), travelling at an angle to the axis 
of the active medium, build up an avalanche of photons which, 
after multiple reflection, disappears from the medium and docs 
not participate in light amplification. The high coherency, sharp 
directional properties and great intensity of laser radiation are 
the basis for its ever-increasing application in science and engi- 
neering. 


PART SEVEN 
BASIC SOLID-STATE PHYSICS 


CHAPTER 40 STRUCTURE AND CERTAIN 
PROPERTIES OF SOLIDS 


40.1 Structure of Solids 


40.1.1 Solids are bodies having constant shape and volume. 
Distinction is made between crystalline and amorphous solids. 
Externally, crystals have a regular geometric shape. They have 
a periodically repeated arrangement of their constituent particles 
throughout the whole crystal. This arrangement is called the 
crystal, or space, lattice. In this sense, a crystal is said to have 
long-range order (cf.. Sect. 13.1.2). Crystals are bounded by flat 
faces, orderly arranged with respect to one another, which converge 
at the edges and vertices. Large separate crystals having the 
shape of regular polyhedrons are called monocrystals. Their shape 
is determined by their chemical composition. Polycrystals have 
a fine crystalline structure made up of a great number of small 
randomly arranged crystals (crystal grains or crystallites) that have 
grown together. 

Amorphous solids (pitch, glass, etc.) are supercooled liquids and 
have no clearly defined properties of crystals. 

In the following, discussion is confined to crystalline bodies. 
40.1.2 Each particle in a crystal lattice is subject to the forces 
of intermolecular interaction (12.1.4). An equilibrium arrange- 
ment of all the particles of a solid at the points of.the crystal lattice 
(8.1.4) corresponds to minimum free energy of the crystal (14.4.5) 
and its most stable state. The particles at the points of the crystal 
lattice are located at certain equilibrium distances from one anoth- 
er called the lattice constants. 

40.1.3 The main types of crystalline solids, distinguished from 
one another by the nature of the forces of interaction between 
their particles and the kinds of particles located at the crystal 
lattice points, are the following. 
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(a) Ionic crystals (NaCl, calcium carbonate and other salts). 
Located at the lattice points are regularly alternating positive 
and negative ions, between which are heteropolar bonds (39.4.4). 
(b) Valence (covalent) crystals (C, Ge, Te, etc.). The neutral atoms 
at the lattice points have homopolar bonds (39.4.5) between them. 
Such crystals are found in semiconductors (41.10.1) and in many 
organic solids. 

(c) Molecular crystals (At, CH,, paraffin, etc.). The points of the 
crystal lattice are occupied by molecules that are bound by Van 
der Waals forces, mainly of the dispersion type (12.1.8). 

(d) Metals (Na, Cu, Al, ctc.). At the lattice points are positive 
ions that are forined after the outer (valence) electrons (39.3.9) 
have been stripped from the atoms. These electrons form an 
electron gas (16.4.1) of collectivized free particles. 

The metallic bond is a specific type of chemical bond (39.4.1) 
that is developed between the ions in the crystal lattice and the 
electron gas. The electrons “constrict” the positive ions (mainly 
by electrostatic forces) and thereby compensate for repulsion 
between the ions. At distances between the ions equal to the 
lattice constant (40.1.2), the metallic crystal acquires a stable 
state. 

40.1.4 The anharmonic nature of the thermal vibration of the 
particles at the crystal lattice points (8.1.4) is manifested in the 
fact that the dependence of the potential energy of interaction of the 
particles on their displacement from the equilibrium position is not 
a parabolic one. The force acting on a particle is not quasi-elastic 
(40.3.5). These facts are of prime importance in understanding 
certain thermal (40.2.4) and electrical (41.5.4) properties of solids. 
40.1.5 A typical property of monocrystals (40.1.1) is their aniso- 
tropy (crystalline anisotropy), which is the dependence of the phys- 
ical properties of solids (thermal, elastic, electrical and optical 
properties) on the direction in the crystal. 


40.2 Thermal Expansion of Solids 


40.2.1 A solid is subject to thermal expansion, of cither linear or 
volume type, when its temperature is raised. Both kinds of therm- 
al expansion are characterized by the average coefficients of linear 
a, and cubic, or volume, &y expansion for the given temperature 
range. 
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40.2.2 If l is the length of a body at the temperature 0 °C, its 
elongation Al upon being heated to the temperature ¢ °C is 


Al = Ojlot, 


from which a, = Al/Ipt. 

‘The linear expansion coefficient characterizes the elongation per 
unit length A7/l as the body is heated one degree. For most solids 
a, = 10-§ to 10- K-1 and depends only slightly on the tempera- 
Lure. 

40.2.3 When a body is heated from 0°C to ¢°C, its volume 
increases from V, to V according to the law 


V = Vo (1 4+- &yt), 


from which ay = AV/Vot. The cubic expansion coefficient char- 
acterizes the volume increase per unit volume AV/V, as the body 


is heated by one degree. 
In a first approximation, the relation between the coefficients ay 


and a, is of the form 


hy xX 3a. 


40.2.4 The thermal expansion of solids is due tg the anharmonici- 
ty of the thermal vibration of the particles in the lattice (40.1.4). 
If rọ is the equilibrium distance between adjacent particles, then, 
at some arbitrary instant, the distance between\them is r = rp -+ 
l- x, where z is’ the mutual displacement of the~particles from 
their equilibrium positions due to thermal vibration. The forces 
between the particles in the lattice are not quasi-elastic, but 
depend on the displacement zx according to the relation 


Vy = —xr + bx, 


where x is the coefficient of quasi-elastic force (40.3.5), and b 
is the coefficient of anharmonicity of the vibration. The term bz? 
characterizes the deviation of the vibration from its harmonic 
type due to the different dependences of the forces of attraction 
and repulsion on the distance. 

When a solid is in an equilibrium state the positions of the crystal 
lattice points should not vary with time and the average force 
acting on each particle should equal zero: | 


apt 


582 STRUCTURE AND PROPERTIES OF SOLIDS [40 


If the vibration of the particles was strictly harmonic (Fy = —xz), 
the average displacement of the particles would be (zx) = 
= —(F,.)/x = 0, i.e. there would be no thermal expansion. For 
real anharmonic vibration, it follows from the condition that 
(Fx) = 0, that 


—x(z)-+b(x?)=0 or (x)= 2 (x3). 


For thermal vibration with small amplitudes the potential energy 
Ep of the vibrating particles is equal (28.2.3) to: 


“x2 
2 


But, according to the law of equipartition of energy (10.4.6), 
Ep = kT/2, where k is Boltzmann’s constant (8.4.5) and T is 
the absolute temperature. Thus 

x(x?) kT kT 


IESS or Gor P 


2 2 


Ep= 


Finally, (x) = bkT/x?. Hence, the average distance between 
the particles of a solid increases when it is heated, leading to 
thermal expansion. 


40.3 Brief Information on the Elastic 
Properties of Solids 


40.3.1 Deformation of a solid is the change in its dimensions and 
volume. Deformation is usually accompanied by a change in the 
shape of the body. Sometimes (in triaxial tension or compression) 
no change in shape occurs. Deformation may be caused by exter- 
nal forces acting on the body, a change in its temperature (40.2.1) 
and other reasons. 

Deformation of a body leads to displacement of its particles from 
their initial equilibrium positions at the crystal lattice points 
(8.1.1) to new positions. This displacement is opposed by the 
forces of interaction between the particles. Internal elastic forces 
are developed inside the body to counterbalance the external 
forces causing the deformation. 
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40.3.2 Deformation is said to be elastic (elastic deformation) if it 
disappears as soon as the external forces causing it cease to act. 
At this the particles of the solid return to their initial positions 
of equilibrium (40.3.1). Inelastic deformation of a solid leads to 
irreversible rearrangement of the crystal lattice (40.1.1) and the 
shape of the body is not restored when the external forces are re- 
moved. Such deformation is said to be plastic (40.3.8). The transfor- 
mation of elastic deformation into plastic deformation may occur 
upon prolonged action of even small external forces applied to the 
body. The reverse transformation cannot occur. 

The degree of elasticity is the ratio of the work that can be done by 
a deformed body when the deforming forces are slowly removed 
to the work done by external forces in deforming the body. 
40.3.3 The physical quantity equal to the elastic force dF, per 
unit area dS of the cross section of the body being deformed is 
called the stress o: . 


— dFej 
=a 


The stress is said to be normal if force dF ,; is normal to the surface 
dS and tangential, or shearing, if the force is| tangential to this 
surface. 

40.3.4 A measure of the deformation is the uniù strain, or relative 
deformation. It is equal to the ratio of the absolute deformation 
Ax to the initial value of the quantity z, which characterizes the 
shape or dimensions of a deformed body: Az/z. 

TTooke’s law states that the stress in an elastically deformed body 
is proportional to its unit strain. Thus 


o= Ky, — 
x £ ? 


where K, is the modulus of elasticity, equal in magnitude to the 
stress developed in relative deformation equal to unity. The 
quantity a, = 1/K, is called the coefficient of elasticity. Hooke’s 
law is valid only for sufficiently small relative deformation. The 
stress Op at which the deformation is no longer proportional to the 
stress is called the proportional elastic limit (point A in Fig. 40.1). 

40.3.5 In addition to elastic forces, there are forces of other origin 
than elastic forces, but which satisfy the equation a 


Py = — XT, 
P f 


584 STRUCTURE AND PROPERTIES OF SOLIDS [40 


where F is the projection of the force on the direction in which 
the absolute linear deformation zx takes place. They are called 
quasi-elastic forces. The quantity x is called the coefficient of the 
quasi-elastic force. 

40.3.6 The simplest type of deformation is longitudinal, or uni- 
axial, extension (or compression), which consists in elongating 
(shortening) a body by the action of the tensile (compressive) 
force F. Elastic extension (compression) ceases when F = Fel, 
where Fe; is the elastic force (40.3.1). The unit strain (relative 
deformation) Az/zx = Al/l, where Al is the change in length due 
to the action of force F, and l 
is the initial length of the body. 
According to Hooke’s law 

F Al 


ee 


where the modulus of elasticity 
K, = Eiscalled Young’s modu- 
lus. Young’s modulus is equal to 
the normal stress o at which the 
linear dimension of the body is 
Fig. 40.1 doubled: Al = I*. 

40.3.7 The dependence of the 

normal stress o on the unit strain 
Al/l in uniaxial extension is called the tensile stress-strain diagram 
(Fig. 40.1). Beyond the proportional elastic limit (40.3.4), an 
increase in o leads to a considerable increase in Al/l. At the stress 
Gy, corresponding to point B in the'stress-strain diagram, the unit 
strain continues to increase without increasing the stress (hori- 
zontal portion BB’ in the diagram). The stress oy is called the 
yield point. The maximum stress 0; that the body can withstand 
before rupture, point C in the diagram, is called the ultimate 
strength or, if the body is in tension, the tensile strength. Point D 
corresponds to the rupture of the body. 
40.3.8 If the load on a body, deformed to the stress og (point a 
in the diagram in Fig. 40.1) is slowly removed, the curve o = 
— @ (Al/l) is the straight line aR, parallel to the straight portion 
OA of the diagram. The line segment OR determines the permanent 
et, typical of plastic deformation. 
S 


* Assuming that Hooke’s law is valid for such large deformation. 
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40.3.9 Shear is the deformation of a body in which all its flat 
layers, parallel to a certain shear plane, are displaced parallel 
lo one another without distortion or any changes in dimensions 
(lig. 40.2). Shear results from 
the action of tangential force F, 
applied to face BC, parallel to 
(he shear plane. Face AD, paral- 
lel to face BC, is fixed. For 
small shear deformation 

CC’ 
y = tan y=, 

CD 


where CC’ = Az is the absolute 
shear, y is the shear angle, or Fig. 40.2 


unit shear strain, expressed in ra- 
dians. According to Hooke’s law 
40.3.4), the unit shear strain is proportional te the tangentiall 


(shearing) stress o, = F/S, where S is the area of fa ; i.e. 


Or == Gy. 


‘The quantity G is called the shear modulus. Its magnitude is equal 
to the tangential, or shear, stress that produces a unit shear strain 
equal to unity in the specimen. 


40.4 Basic Concepts of Phase Transitions in Solids 


40.4.1 The heating of a solid crystalline body leads to an increase 
in the amplitude of the anharmonic thermal vibration at the points 
of the crystal lattice (40.2.4) and to an increase in the average 
interpoint distances in the lattice (thermal expansion, Sect. 
40.2.1). Intense heating destroys the crystal lattice and leads 
to the transition of the substance from its solid to its liquid or 
vapour phase (first-order phase transition, Sect. 12.4.2). 

40.4.2 The transition of a substance from the solid to the gaseous 
state is called sublimation. Transition from the solid to the liquid 
state is called melting, or fusion. Melting begins at a definite 
temperature Tm for the given pressure; it is called the melting 
point, This temperature remains constant during the melting 
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process. Shown in Fig. 40.3 is the dependence T (Q), where Q is 
the quantity of heat (9.2.1) delivered to the solid being heated. 
The isothermal portion BC corresponds to a two-phase system: 
solid-liquid (cf. Sect. 12.3.2, where the two-phase system consists 
of a liquid and saturated vapour). In the melting process, the 
substance is converted from a more ordered crystalline state to a 


Solid Melting Liquid 
oes ees eee 
g Qe yo W 
ln 


Fig. 40.3 


less ordered liquid state. According to the second law of thermo- 
dynamics (11.3.2), melting is associated with an increase in the 
entropy (11.4.3) of the system. 

40.4.3 The quantity of heat rm required to melt unit mass of 
a solid at the temperature Tm is called the heat of fusion, or of 
melting. It is also called the latent heat. It follows from the first 
law of thermodynamics (9.3.1) that 


Tm = Ul — Ug + p (vi — Us); 


where u, and us are the specific internal energies (9.1.2) of the 
substance (internal energies per unit volume) in the liquid and 
solid phases, v, and vs are the specific volumes of the liquid and 
solid phases, and p is the constant pressure at which the first- 
order transition proceeds (12.4.2). 


40.4.4 The melting point Tm depends upon the pressure p. 
A change of dp in the pressure leads to a change in the melting 
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point by d7,,. The dependence Tm = Tpm (p) is expressed by the 
Clapeyron-Clausius equation (see also Sect. 13.6.5): 


dT _ Tm (vi—s) 
dp Tn : 

The notation is defined in Sect. 40.4.3. The Clapeyron-Clausius 
equation is valid for any first-order phase transition. 

In the overwhelming majority of cases the melting of a substance 
leads to an increase in its specific volume, i.e. vj > vs and 
dTm/’dp > O (rm > 0), i.e. the melting point increases with the 
pressure. Such substances can be in the solid state at high pres- 
sures in the temperature region above the critical temperature 
(12.3.4). 

‘The specific volume of certain substances (water, bismuth, 
lium) increases in melting, so that vy < vs. Ice, for instance, 
at 0°C has a lower density 
(han water. Forsuch substances 
dT,,/dp <0, i.e. the melting P C 

point decreases as the pressure 

is increased. 

40.4.5 In a pT diagram, the de- Liguid Vapour 
pendence of the temperature on 

the pressure p = f (T) in a first-or- 

der phase transition is represent- 

ed by a curve on which each point 

corresponds to equilibrium of 

two existing phases. The vapour Tle 
generation curve, for example, g 7 
shows the equilibrium of the 

two-phase system: liquid-vapour. Fig. 40.4 

Since vy — vy œ 0 (where vy is 

the specific volume of the vapour), 

it follows from the Clapeyron-Clausius equation (40.4.4), that 
dpldT > 0. The curve p = f (T), separating the regions of the 
liquid and gaseous states of a substance, ends at critical point C 
(12.3.1) (Fig. 40.4). Shown in Figs. 40.5 and 40.6 are the equilib- 
rium curves of the two-phase solid-liquid states for the cases 
when vj > vs and vj < vg, respectively. 

40.4.6 In cooling liquids to a certain temperature, called the 

freezing point Tg (or temperature of solidification), the substance is 
solidified (crystallized, or frozen). This is the transition from 
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a liquid to a solid crystalline state. In this process a quantity of 
heat is evolved that is equal to the heat of melting (40.4.3). The 
freezing point is equal to the melting point and depends upon the 
presence of impurities in the substance. Impurities lower Tẹ if 
they do not form mixed crystals with the substance. These are 
called solid solutions. Sea water, for instance, containing dissolved 


pk | 
| P Kel, 
Solid Liguid Liguid 
Solid 
0 T g F 
Fig. 40.5 Fig. 40.6 


salts, freezes at a lower temperature than distilled water. This 
serves as the basis for producing cooling mixtures. 

40.4.7 The freezing (crystallization) of liquids is asscciated with 
a change in the nature of the thermal motion of their particles. 
The time the particles remain in a temporary settled state (relaxa- 
tion time, see Sect. 13.2.4) is increased. The forces of mutual 
attraction between the particles change the thermal motion into 
random thermal vibration about the points of the crystal lattice. 
The transition of the substance to a more ordered phase is asso- 
ciated with a reduction in the entropy of the system (11.4.3). 


CHAPTER 41 AN OUTLINE OF THE QUANTUM 
PHYSICS OF SOLIDS 


41.1 Basic Concepts of Quantum Statistics 


41.1.1 Quantum statistics is a statistical method of investigation 
(8.2.2) that is applied to systems consisting of a large number of 
particles and obeying the laws of quantum mechanics (38.1.4). 
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41.1.2 In contrast to the initial propositions of classical statisti- 
cal physics (10.1.4 and 10.1.5), quantum statistics is based on the 
principle of indistinguishability of identical particles. All identical 
particles (for example, all the electrons in metals or all the pro- 
tons in atomic nuclei) are considered to be indistinguishable, in 
principle, from one another. 

41.4.3 Of prime importance in quantum statistics is the problem 
of distributing the particles with respect to their coordinates and 
velocities*. It can be stated as the problem of distributing parti- 
cles amongst the cells of “phase volume”, an element of which is 
equal to Al’ = AxAyAzAp,Ap,Ap, (10.1.5). The wave-particle 
dualism of the properties of particles of matter (38.1.3) and the 
Heisenberg indeterminacy relations (38.6.2) lead to a situation 
in which the clement of volume AT cannot be less than h?, where h 
is Planck’s constant (Appendix II). In arranging the particles 
in cells of volume kè each, in accordance with the principle of 
indistinguishability of identical particles (41.1.2), the question 
posed concerns the number of particles in the given cell, but not 
which, specifically, of the given sort of particles are in the cell. 
The state of a system of particles is not changed when identical 
particles are rearranged within the given cell of the phase space, 
or interchanged between the cells. 
41.1.4 If an element of volume AT; is divided inty cells with 
the volume kë, then the ratio AI';/h? = Ag; determines the number 
of quantum states contained in the volume AT;, having energy 
within the range from E; to E; + AE;. Particles, of a number 
equal to AN;, contained in the volume AT;, can bé distributed 
in all possible ways between Ag; states having the energy £;. 


41.2 Bose-Einstein and Fermi-Dirac Distribution 
Functions 


41.2.1 Particles with integer or zero spin (in # units) (26.1.3) 
are called bosons (for instance, photons, phonons and certain 
nuclei). Systems of such particles are described by Bose-Einstein 
quantum statistics. Bosons do not comply with the Pauli exclusion 
principle (39.3.1) and they are not subject to restrictions as to the 
number of particles that may be in a certain quantum state. 


* In classical statistics, the laws of Maxwell (10.3.1) and Boltz- 
mann (10.4.2) enable a similar problem to be solved. 


gc 
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41.2.2 The Bose-Einstein distribution function fp is the average 
population of bosons in a state with the given enetgy, i.e. the 
average number of particles in one state. Thus ) 


fe AN (Ei) 
Bea 


where AN (E;) is the number of particles having energies iti 
the range from EF; to E; + AE;, and Ag; is the number of quan- 
tum states within this energy range (41.1.4). Function fg is 
found by considering the thermodynamic probability P (41.5.2) 
of the distribution of the particles of the system amongst the 

uantum states. The most probable distribution is found under 
the condition that the total number N of particles and the total 
energy E of the system are conserved. Thus 


N, AN;=N and Y AN;E;=E. 


t t 


Summation is carried out over all the quantum states of the 
system. 


41.2.3 An application of the method of Lagrangian (undeter- 
mined) multipliers to find the conditional extremum enables the 
following expression to be obtained for the Bose-Einstein 
distribution function: 


Here k is Boltzmann’s constant (8.4.5), and T is the absolute 
temperature. The quantity u = (U — TS + pV)/N is called 
the chemical potential per particle, where U is the internal energy 
of the system (9.1.2), S is its entropy (11.4.2), V is the volume 
of the system, and p is the pressure. The chemical potential is 
the work done under isobaric-isothermic conditions in increas- 
ing the number of particles in the system by one. 

41.2.4 Particles with half-integer spin (in A units) (26.1.3) are 
called fermions (electrons, protons, neutrons, etc.). Systems of 
fermions are described by Fermi-Dirac quantum statistics. 
Fermions comply with the Pauli exclusion principle (39.3.1) 
and in a given quantum state of a system of fermions there 
can be no more than one particle. 
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41.2.5 The Fermi-Dirac distribution function fp is determined 
in a way similar to that used in Sect. 41.2.2. Thus 


The notation is defined in Sect. 41.2.2. The solution of the 
problem of the most probable distribution of the fermions 
amongst the states, under the condition of conserving total 
energy E in the system and the total number N of its particles, 
leads to the following form of the function fp: 


1 
= == T 
ace 
exp ( TT 2 aea 


where u is the chemical potential per particle (41.2.3). 
41.2.6 The distribution functions in classical and\ quantum 
statistics, introduced as the average number of particles in one 
state (44.2.2), can. be ex- 
pressed by a single equation: F 


1 

Ei—y ' 
oe ( T )+6 
For the Maxwell-Boltzmann 
distribution (cf. Sect. 10.4.2) 7 
6=0 and w= 0; for the 
Bose-Einstein distribution 
ð = —1; and for the Fermi- 0 
Dirac distribution 6 = +1. 
Plots of these three distri- 
bution functions are shown 
in Fig. 41.4. 
Example. Blackbody radiation (36.1.8) is enclosed in a cavity 
of volume V at T = const, the radiation being in a state of 
thermodynamic equilibrium with the walls of the cavity. This 
radiation can be dealt with as a photon gas complying with 
Bose-Einstein statistics, because the spin of a photon equals h 


Maxwell - Boltzmann 
Bose - Einstein 


fermi-Dirac 


j = 


Fig. 44.4 
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(Table 43.1). The number of quantum states of radiation in 
volume V (40.1.4) is 


Anp*dpV _ 8nk? dE 


dg=2 h3 hèc 


V, 


where p is the momentum of the photon (37.2.2), related to its 
energy by the equation p = E/c, where c is the velocity of light 
in free space. The factor 2 appears because there are two possible 
directions in which light can be polarized (31.1.8). The energy 
of the photon is E = hy (87.1.4), where v is the frequency. 
According to Sect. 41.2.2, the number of photons with frequen- 
cies in the range from v to v -+ dv in the volume V is 


IN = dg a 8nv? dvV 
om = “exp (Av/kT)—1 c [oxp (hv/kT)—1]° 

Taken into account here is the fact that the chemical potential u 
(41.2.3) is equal to zero for a photon gas in which the condition 
for conserving the total number of particles is not complied 
with. 

The volume density of radiant energy in the frequency range 
from v to v-+ adv is 

hvdN  8v? hv dv 


Oe ee Vo & exp (hv /kT)—1 * 


This result is Planck’s radiation formula for the volume energy 
density of blackbody radiation (36.1.5). 


41.3 Degeneracy of Systems of Particles Described 
by Quantum Statistics 


41.3.1 A system of particles (in particular, an ideal gas) is 
said to be degenerate if its properties, described by quantum 
laws, differ from those of ordinary systems that obey classical 
laws. Deviation in the behaviour of Bose and Fermi gases from 
that of classical Maxwellian-Boltzmann gas is called gas degener- 
acy. This becomes appreciable at extremely low temperatures 
and high densities (41.3.3). 
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41.3.2 The degeneracy parameter A is the quantity 
we eRT 


where p is the chemical potential (41.2.3). Under the condition 
of slight degeneracy (A < 1), the ones in the denominators of 
(he quantum distribution functions fpg and fp can be neglected, 
ind these functions become the classical Maxwellian-Boltzmann 
distribution function fy_p. Thus 


Im-p = A grr 


The degeneracy parameter is found from the condition of nor- 
malization of the distribution function: 5) AN; = N (41.2.2), 


l 
which indicates the conservation of the total number of particles 
in the system. The condition for slight degeneracy is of the form 
Noh? 


“= CamiTyR Í t 


where no is the concentration of particles, m is the 
particles, k is Boltzmann’s constant, h is Planck’s 
and T is the absolute temperature. 

41.3.3 The degeneracy temperature Tg is the temperature at 
which degeneracy becomes appreciable (41.3.1). It is determined 
from the condition 


2/3 
_ Noh? __ : une h?no 
= QnmkTF AA =1, from which Tg= coe 


‘The temperature criterion of degeneracy is as follows: when 
T < Tq the system of particles is degenerate; when T > Ta 
the system of particles is not degenerate, and its behaviour is 
described by classical laws. 

lor example, the degeneracy parameter A x 3 X 10° <1 
for hydrogen under standard conditions (T = 300 K and ny ~ 
=~ 3 X 10235 m). The degeneracy temperature for hydrogen 
is Tg ~ 1K. For all other gases, heavier than hydrogen, A 
is even smaller. Under standard conditions gases are not degen- 
erate. Degeneracy, associated with the quantum properties of 
vases, is manifested to a considerably less degree than the devia- 


38--0186 
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tions of gases from an ideal gas, which is due to intermolecular 
interaction. Photon gas is always degenerate (Tg == co) because 
the rest mass of a photon equals zero (mọ = 0, Table 43.1). 
can gas is described by Bose-Einstein quantum statistics 
(41.2.6). 

41.3.4 Electrons in metals are an example of a degenerate gas. 
Under ordinary conditions ny ~ 1028 to 1029 m-%. Since the mass 
of the electron is small (m œ 10-*° kg), Tq ~ (16 to 20) x 
x 10% K. Electron gas does not comply with the classical Max- 
well-Boltzmann statistics. 

41.3.5 The energy of degenerate electron gas (41.4.5), and 
other gases described by quantum statistics, is not a linear 
function of the temperature. Therefore, the simple physical 
interpretation of the absolute temperature (8.3.4) is not suitable 
in the region of degenerate gases (41.3.3). 


41.4 Degenerafe Electron Fermi Gas in Metals 


41.4.1 The energy distribution of conduction electrons in 
metals is described by the Fermi-Dirac distribution function 
(41.2.5): 

1 


[e= EAT] ` 


The number of quantum states of electrons per unit volume of 
metal, within the cnergy range from Æ to E 4+- dE (41.1.4) is 
d _4np?dp _ 27 (2m)3/? E1 dE 


AA E p 1B 


Use is made here of the relation between the momentum p of 
the clectron and its energy: 


1/2 
p2=2mE, ap =(s] Í dE. 
The number of electrons dno (E) in unit volume of the metal, 
having energies within the range from E to E + dF, equals, 
according to Sect. 41.2.2: 
4n (2m)3/2 E12 dE 


dno (E) = 2fp dg (P) = ap E OTT 
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Factor 2 takes into account the fact that the electrons comply 
with the Pauli exclusion principle (39.3.1). This equation 
sxpresses the energy distribution law for conduction electrons 
in a metal. 

41.4.2 The unusual properties of electron gas have an effect 
on its behaviour at the absolute zero temperature (T == 0 K). 
- uo is the chemical potential of the gas at T = 0 K (41.2.3), 
then 


4 
[e = 


AE voT 4’ 


at E < uo the difference E — uo < 0 and 


1 
{y= — ~: 


pHo- E)/RT Ti 


at E — puo <0 and T —0, exp [(uo — E)/kT] > œ and 

fp — 1; at E > uo, the difference E — uo > 0 and at T — 0, 

exp [(E — uo) /kT] > œ and fp — 0. 

The curve of the Fermi distribution function fp at T =0K 

is given in Fig. 41.2. In the energy range from 0 to po, function 

fp is equal to unity. In pas- £ 

sing through the value E = uo, Fo 

the function drops abruptly to : 

zero. At T= 0 K, the electrons / 

of the metal occupy all allowed 

energy levels with a population 

equal to unity (41.2.2) up to the 

level with the energy E = uo- 

All levels with snetey exceeding g ko E 
9 are empty; their population s 

s zero. Thus, po is the max- Fig. 41.2 

imum energy that conduction 

electrons can have at absolute zero temperature. This quantity 

is called the Fermi energy: up = Ey. If the consecutive numbers 

of the energy levels are plotted along the axis of abscissas in 

lig. 44.2, and the corresponding numbers of conduction electrons 

per unit volume are the ordinates, the maximum ordinate 

corresponds to the energy level number n)/2, where ny is the 

electron concentration. The highest level occupied by clectrons 


38% 
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is called the Fermi level. It corresponds to the Fermi energy Ep, 
which an electron has at this level. 

41.4.3 The energy distribution law for conduction electrons at 
T = 0K is of the form 


An (2m)3/2E1/2 dE 
dno (m= Seno 


It is represented graphically by the curve in Fig. 41.3. 
The total number of conduction electrons in unit volume of 
a metal is 


Z 3/2 4n (2m)3/2 
i \ dng (E) = FE \ Ezar = TEW È py? 
0 o 
Hence 


Ep 


=. Bno j 
~ 2m ( Sit 


The average energy (E) of the electron at T = 0 K is 
3 3 h? / 3ng \2/3 
G e a (Se) 


At no = 6 X 1028 m- we obtain, after substituting the values 
of h and m: 


(E) = 9 X 10- J = 5.4 eV. 


If we compare this energy with the average energy of a molecule 
of nondegenerate monatomic gas (E) = 3kT/2 (40.2.5), we 
find the energy Fy that the molecule could have at the tempera- 
ture T x 104 K. In other words, under ordinary conditions, 
kKT/Ep < 1. 

41.4.4 At a temperature different from zero the Fermi-Dirac 
distribution function fp is of the form shown in Fig. 41.4 by 
the full-line curve. The same figure shows the function fp at 
T = 0 K in dash lines. The curves differ in the kind of drop 
the function undergoes near the value E = p. An abrupt drop 
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along a vertical line at E = uo = Ep for the case when T = 
-= 0 K (Fig. 41.2) changes to the smooth curve ABC at T Æ 
0 K. Of importance is the fact that the states of the electrons, 
at the energy levels that satisfy the condition E < u, do not 
change when heated from 0 K to T. Distortion of the function 


dn 
dE J 
f Pn at 
7 g B 
iG 
Fig. 41.3 Fig. 41.4 


fp occurs only at its “tail-end”, in an energy range of the width 
2kT, near the value E = yp, at which fp = 1/2. 

41.4.5 The chemical potential u (41.2.3) of an electron gas at 
the temperature T is 


a i T? f kT \2- 
p= Er [1-35 (ze) |. 


The internal energy U (9.1.2) of a mole of electron gas is 
_ 3 | 5 m2 (kT)? ` 
Ca aes [1+ 12 | 


Here Na is Avogadro’s number (Appendix II), and k is Boltz- 
mann’s constant (8.4.5). Quantities u and U remain practically 
constant when the temperature is raised. 

The molar heat capacity Cy, (9.5.2) of an electron gas is 


ðU m j kT 


U a as 
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A comparison with the heat capacity of a nondegencrate mon- 
atomic gas (10.7.1) Co =: 3N,k/2, indicates that 


Cyu _ T kT 
Class vm 
Cya 3 Ep 
At room temperature k7/Fy x% 0.01 (41.4.3), and therefore 


Cyu CYS ~ 0.03. The heat capacity of degenerate electron 


gas is negligibly small. This is so because only a slight number 
of electrons, within the “drop region” of the Fermi-Dirac dis- 
tribution function fp (hatched regions 7 and 2 in Fig. 41.4), 
participate in the process of changing the internal energy of 
the electron gas when it is heated. This removes one of the chie! 
shortcomings of the classical electron theory of the electrical 
conduction of metals (20.3.7). 


41.5 Quantum Theory of Electrical Conduction 
in Metals 


41.5.1 The theory of the electrical conduction of metals, based 
on quantum mechanics (38.1.4) and Fermi-Dirac quantum sta- 
tistics (41.2.5) is called the quantum theory of electrical conduc- 
tion in metals. Using the Fermi-Dirac distribution function 
(41.2.5) in this theory, Ohm's law for current density (20.3.4) 
was derived. Thus 


j = xE. 


41.5.2 In the quantum theory of electrical conduction, the, 
electrical conductivity x is calculated by the formula 
noe? (ÀF) 


MUJ 


where ny is the number of conduction electrons per unit volume 
of the metal, (Ay) is the mean free path of the electron (20.3.1) 
having Fermi energy (41.4.2), and up is the velocity of thermal 
motion of such an electron. Notwithstanding the external re- 
semblance between this formula and the formula for x in the 
classical electron theory (20.3.4), it has an entirely different 
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physical content and, in contrast to the classical formula, it 
completely agrees with the experimental data. 

41.5.3 Also explained in the quantum theory of electrical 
conduction of metals is the dependence of electrical conductivity 
on temperature (x X 1/T), as well as the anomalously large 
mean free path of the electron in a metal (20.3.7). The ordered 
motions of electrons in a metal—electric current—is dealt with 
in quantum theory as the process of propagation of electron 
de Broglie waves (38.1.4) scattered by anharmonic thermal 
vibrations of the lattice points in the metal (40.2.4). An an 
vous phenomenon occurs in optics in the propagation of a/i 
wave through a turbid medium, containing scattering gentres 
(suspended particles in a liquid, colloidal solutions,’ etc.) 
(34.3.1). When the distance between the scattering centres are 
of an order comparable to the wavelength, scattering of the 
light occurs and the intensity of the light decreases as it propa- 
vates in the medium. At distances between the scattering centres 
substantially less than the wavelength, the medium is optically 
homogeneous and no light scattering occurs. 

41.5.4 An ideal crystal lattice, free of all violations of periodi- 
city and with stationary particles at the lattice points, behaves 
like an optically hgmogeneous medium; it does not scatter 
electron waves and conduction electrons pass through such 
a lattice without resistance. The scattering of electron waves 
occurs only when the periodicity of the lattice is violated. 
‘hese are inhomogeneities that act as scattering centres. Such 
centres are, for example, density fluctuations in the lattice 
(11.6.3), resulting from anharmonic thermal vibrations of the 
positive ions of the metal. With the exception of superconductors 
(41.6.1), this scattering is what leads to the existence of electri- 
cal resistance in pure metals. 

41.5.5 The scattering of the electron waves by the thermal 
vibrations of the lattice points increases with the temperature, 
whereas the mean free path of the electrons is reduced. 

‘The quantity (Ap) is calculated by the formula 


Ed 
(Ap) = Mn kT ’ 


where Æ is Young’s modulus (40.3.6), d is the crystal lattice 
constant, no is the number of atoms in unit volume, k is Boltz- 
mann’s constant (8.4.5), and T is the absolute temperature. 


600 QUANTUM PHYSICS OF SOLIDS [4 


Here the electrical conductivity is found to be inversely pro- 
portional to the absolute temperature and independent of ny 
for monovalent metals*. Thus 


_ Ed 
— mupnkT': 


This formula agrees with experimental data in the region of 
room temperatures. The conductivity, for instance, obtained 
for silver (E = 10? N/m?, d= 3 X 10" m, m=9 xX 
x 10-2 kg, kT ~ 4.2 X 10-2 J, and up = 1.4 m/s) is 
%theor S 5 X 107 1/Q-m, whereas Xxexper ~ 6.3 X 107 1/Q-m. 


F 
Residual 
resistivity 
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Fig. 41.5 Fig. 41.6 


At very low temperatures the formulas given above are not val- 
id. The mean free path under these conditions is inversely pro- 
portional to 75, which agrees with experimental data. 
41.5.6 Besides density fluctuations, the cause of electron wave 
scattering and the electrical resistance of metals is violation 
in the periodicity of the crystal lattice due to the inclusions of 
impurity atoms in the lattice. The resistivity p of a metal 
(0 = 1/x) consists. of two parts: | 


£ = Prt Pim, 


where 07 is the resistivity due to the scattering of electron waves 
by density fluctuations, and pim is the resistivity due to scat- 
tering by impurities. Of importance is the fact that pj, is 
Independent of the temperature. Since as T > 0, py —> 0 as 


* The concentration of conduction electrons and ions in the 
lattice coincide in such metals. 
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well, @ > Pim as the temperature is lowered. The quantity 
Pim is called the residual resistivity. This resistivity remains a 
property of the metal when it is cooled even down to absolute 
zero. The dependence of the electrical resistivity on the tem- 
perature is shown in Fig. 41.5. The point of intersection of the 
extension of the curve (dash line) with the axis of ordinates 
(to T = 0 K) is the residual resistivity. 


41.6 Superconductivity 


41.6.1 The phenomenon of superconductivity consists \in the 
practically complete disappearance of electrical resistivity 
in certain metals and alloys at a certain temperature Te, called 
the transition temperature to a superconductive state. Substances 
having this property are called superconductors. Today over 
500 pure elements and alloys are known that display supercon- 
ductivity. The temperature range AB (Fig. 41.6) of the transi- 
tion to a superconductive state does not exceed thousandths of 
a degree for pure specimens. It therefore proves expedient to 
assign a definite value to Te. The width of the range AB de- 
pends on nonhomogeneities in the metal, primarily on the pres- 
ence of impurities and internal stresses in the specimen. The 
temperatures 7, known at the present time range from 0.155 K 
(BiPt) to 23.2 K (Nb,Ge). 

The isotope effect in superconductors is the fact that the transi- 
tion temperatures Te of various isotopes (42.1.3) of the same 
superconducting mctal are inversely proportional to the square 
roots of their atomic masses. 

41.6.2 A sufficiently strong magnetic field at a given tempera- 
ture destroys the superconductive state of a substance. When 
a magnetic field is applied to a superconductor, the transition 
temperature Te (41.6.1) is lowered. 

A magnetic field of the strength H, which, at the given tempera- 
ture, causes the transition of a substance from the superconduc- 
tive to the normal state is called the critical field. As the tempe- 
rature of the superconductor is lowered, the field strength H, 
increases. As a first approximation: Me = IM, (4 — (7/T,)?]. 
The curve in Fig. 41.7 separates the superconductive and normal 
states of a substance. 

The superconductive properties of a conductor disappear when 
a sufficiently heavy current is passed through them. This is 
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due to the action of the magnetic field, set up by the current, 
which destroys the superconductive state. 

41.6.3 An external magnetic field, weaker than the critical 
field (41.6.2) does not penetrate into the interior of a super- 
conductor. Hence, the magnetic induction B (23.1.2) is always 
zero in the bulk of the superconductor. Shown in Vig. 41.8 is 


Fig. 41.7 Fig. 41.8 


a uniform magnetic field (23.1.3) directed along the axis of 
a cylindrical superconductor. The superconductor seems to 
“push” the magnetic field out of the space it itself occupies and 
is an ideal diamagnetic material with a magnetic susceptibility 
of Xm = —1 (26.4.5). Here the relative magnetic permeability 
r= 14 Xm = 0 and B = uou H = 0. 
41.6.4 The transition of a substance to the superconductive 
state is accompanied by changes in its thermal properties. 
In the absence of a magnetic field and at the transition temper- 
ature Te (41.6.1), there is an abrupt change in the heat capacity 
(9.5.1). In the presence of a magnetic field an isothermal transi- 
tion from the superconductive to the normal state is associated 
with a jump in the heat conductivity and heat capacity. These 
phenomena are typical features of a second-order phase transi- 
tion (12.4.2). | 
41.6.5 In quantum-mechanical theory, the phenomenon of 
superconductivily is considered to be the superfluidity (12.4.3) 
of electrons in a metal with the absence of friction inherent 
in superfluidity. The conduction clectrons move along a super- 
conductor unimpeded, i.e. without any “friction” between them 
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and the crystal lattice points. The main feature of superconduc- 
(ors is that the mutual attraction of the electrons in them leads 
to the pairing of electrons. The cause of this attraction is inter- 
action between electrons, supplementary to Coulomb repulsion, 
accomplished by the action of the crystal lattice and leading 
lo the pairing. A model of this attraction is illustrated in 
iig. 414.9. Conduction electron 
e, attracts ion J of the crystal 
lattice to itself, displacing the 
ion from its equilibrium posi- 
lion. This changes the electric 
licld in the crystal; ion J sets 
up an electric field that acts on 
the conduction electrons, includ- 
ing electron e». Thus the inter- 
action between electrons eand 
ry is accomplished by means of 
the crystal lattice. The displace- 
ment of the ion by an electron 
leads to a situation in which 
the electron is surrounded by a “cloud” of positive charge, which 
exceeds the intrinsig negative charge of the electron. Together 
with this “cloud” the electron has a net positive charge and is 
thereby attracted to another electron. 

41.6.6 In the quantum theory of metals attraction between 
electrons (phonon exchange) is associated with the setting up 
of elementary excitations (quasi-particles) of the crystal lattice. 
An electron, moving in a crystal and interacting with another 
clectron by means of the lattice, brings the lattice into an excit- 
ed state. As the lattice returns to the ground state it emits a 
quantum of energy of acoustic frequency, called a phonon 
(41.7.5), which is absorbed by the other electron. The attraction 
between electrons can be conceived of as their exchange of 
phonons. This attraction is found to be most effective if the 
momenta of the interacting electrons are anti-parallel. 
41.6.7 The initiation of a superconductive state in a substance 
is associated with the possibility of forming bound pairs of 
electrons in the metal. The distance 6 between the electrons of 
a pair equals 


Fig. 41.9 


hup 
kTe ’ 
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where h is Planck’s constant, up is the velocity of an electron 
at the Fermi level (41.5.2), k is Boltzmann’s constant (8.4.5), 
and 7, is the transition temperature to a superconductive state 
(41.6.1). An estimation indicates that 6 ~ 10-* cm, i.e. the 
electrons forming a pair are at a distance from each other of the 
order of 104 crystal lattice constants. The whole electronic 
system of a superconductor is a bound aggregate extending 
over huge distances on the atomic scale. 

If at arbitrarily low temperatures Coulomb repulsion between 
electrons predominates over the attraction that forms pairs, 
the substance (metal or alloy) remains normal with respect 
to its electrical properties. But if the forces of attraction pre- 
dominate over those of repulsion at the temperature Te (41.6.1), 
the substance goes over to the superconductive state. 

41.6.8 The most important feature associated with the pair 
in the aggregate of electrons in a superconductor is the impossi- 
bility of energy exchange between the electrons and the lattice 
in small portions, smaller than the binding energy of paired 
electrons. This means that in collisions between the electrons 
and the crystal lattice points, the energy of the electrons does 
not change, and the substance behaves as a superconductor with 
zero resistivity. A quantum-mechanical analysis shows that 
there is no scattering of electron waves by the thermal vibra- 
tions of the lattice or impurities. This is what indicates the 
absence of electrical resistance. 

41.6.9 A definite amount of energy is required to destroy the 
state of superconductivity. At T >> Te (41.6.1), the bound 
states of the paired electrons are violated, attraction between 
the electrons ceases and superconductivity no longer exists. 
41.6.10 Superconductivity is being used to produce strong 
magnetic fields because there are no thermal losses when heavy 
currents, required to set up strong magnetic fields, are passed 
through superconductors. But, since a magnetic field destroys 
the state of superconductivity (41.6.2), special type-2 super- 
conductors have been developed for obtaining strong magnetic 
fields. These are certain alloys and superconducting thin films. 
In such superconductors magnetic fields with a field strength 
greater than Me (44.6.2) penetrate the substance in the form of 
threads piercing the specimen. The substance between the 
threads is found to be superconductive, and heavy currents can 
set up ultrastrong magnetic fields. Magnets, based on the appli- 
cation of superconducting solenoids, are being extensively used. 
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41.7 Heat Capacity of Solids 


41.7.1 No distinction is made for solids between the heat capaci- 
lies Cy and Cy (9.5.4 and 9.5.7). In nonmetallic solids, the larg- 


est contribution to heat capacity is made by the energy of 
thermal vibration of the particles at the crystal lattice points. 
l'or metals, a slight contribution is made by the degenerat 
electron gas (41.4.5). 

41.7.2 The classical theory of the heat capacity of solids fis 
based on the principle of equipartition of energy (10.6.4). 
A homogeneous solid is dealt with as a system of particl 

independent of one another, having three degrees of freedom an 
executing thermal vibrations with the same frequency. The 
average energy (EY per degree of freedom is (EY = kT (10.6.5). 
‘The internal energy U (9.1.2) per mole of a solid is 


U = 3N,(E) = 38NakT = 3RT, 


where Va is Avogadro’s number (Appendix II), k is Boltzmann’s 
constant (8.4.5), and R = kN, is the universal gas constant 
(8.4.4). The molar heat capacity* (9.5.4) of a solid with an atom- 
ic crystal lattice is 


; ôU J cal 
Mea ot a 7 TOL 


The Dulong-Petit law states that the molar heat capacity of all 
chemically simple crystalline solids equals approximately 
6 cal/mol-K. According to this law tha molar heat capacity 
does not depend on either the temperature or any characteristics 
of the crystals. Experiments disprove this statement and indicate 
that the heat capacity depends on the temperature, especially 
in the region of low temperatures (Fig. 41.10). The reason why 
the classical theory of the heat capacity of solids disagrees with 
experimental data is in the limited nature of the principle of 
equipartition of energy and its inadequacy in the region of low 
temperatures. At these temperatures the average energy of the 


* Frequently, the heat capacity is referred to one gram-atom of 
the substance. 
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vibrating particles must be calculated in accordance with 
the laws of quantum mechanics (38.5.9). 

41.7.3 In the initial quantum theory of heat capacity of solids, 
a crystal was treated as a system of N atoms, cach of which is 
a quantum harmonic oscillator. All the atoms vibrate indepen- 
dently of one another with the same frequency œ. The average 


Fig. 41.10 


energy (/) per degree of freedom of the atom (as a quantum 
harmonie oscillator) is equal (38.5.9) to 


(E) = hoy = hy 
i exp (fw/kT)—1 exp (hv/kT)—1 ° 


The internal energy U (9.1.2) of a mole of a solid is expressed 
as follows: 

hv 
U=3Na Ele NA een eli) j 


from which the molar heat capacity of a solid can be found 


_ hv \2 exp (hv/kT) 
Cis on ( kT ) [exp (hv/kT)—-1}? ° 
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If we introduce the characteristic temperature Tg = hv/kT, then 
To 


T/T 
2 e 9 
Ca =3R (2) ne RREN 
T J (e of — 4 )2 
This result qualitatively describes the dependence of the heat 
capacity of solids on the temperature. Curves of this dependence 
are plotted in Fig. 41.10. 
At high temperatures (hv < kT), as shown in Sect. 38.5.9, 
(E) = kT in accordance with the law of equipartition of energy 
(10.6.4) and C, = 3R. 
At low temperatures (hv > kT) 


a VIRT Aae VRT and Cu =3R (y gor gis. 


As T +0, T/T — œ and exp (—To/T) > 0 more rapidly 
than the increase in (79/7)?. Therefore, as T — 0, the heat 
capacity Cy > 0. This qualitatively agrees with experimental 
data. But simple quantum theory does not explain the quanti- 
tative behaviour of solids in the vicinity of absolute zero. 
44.7.4 The assumption that all the atoms of a solid execute 
thermai vibrations independently of one anotherat the same 
frequency oversimplifies the true picture of particle vibration 
in a crystal lattice. The interaction between the atoms (or other 
particles) of a solid is so strong that all N particles of the body 
form a bound system having 3N degrees of freedom, and the 
vibration of all the atoms may proceed at various frequencies. 
The extremely complex problem of the frequency distribution 
of atomic vibration in a solid was, in its time, the basis for the 
refined theory of heat capacity of solids. A solid has a wide 
spectrum of vibrational frequencies. There are vibrations with 
sufficiently low and with higher frequencies. The low frequencies 
correspond to elastic vibration of the crystal in the acoustic (or 
ultrasonic) range. Owing to the bonds between the particles of 
a crystal lattice, elastic acoustic waves propagate in the crystal. 
The physical idea of the theory of the heat capacity of solids, 
refining the theory discussed in Sect. 41.7.3, is that the main 
contribution to the energy of thermal vibration of the crystal 
is made by low-frequency vibration, corresponding to elastic 
waves with wavelengths, exceeding the crystal lattice constant. 
This follows, in particular, from Fig. 38.7, which shows that 
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the maximum contribution to the average energy of a quantum 
oscillator is made by vibrations with low frequencies (38.5.9). 
The propagating elastic waves behave as if they were propagat- 
ing in a continuous medium. The atomic structure of the crystal 
has no effect on the propagation in it of elastic waves of wave- 
lengths exceeding Amin = v/Vmax, Where v is the velocity of the 
corresponding elastic wave, and vmax is its frequency. The 
wavelength Amin should be commensurable with the lattice 
constant, i.e. Amin © d% (V/N)1/3, where N is the number of 
particles (crystal lattice points) in a crystal of volume V. 

41.7.5 Elastic waves in a crystal have quantum properties 
consisting in the existence of a minimum portion of energy, 
a quantum of energy of the wave with the given frequency v. 
This enables a wave with the frequency v to be correlated to 
a quasi-particle, the phonon, whose propagation at the velocity 
of sound v corresponds to an acoustic wave. 

A phonon has the energy E = hv, where h is Planck’s constant: 
(Appendix II), and the quasi-momentum p = hv/v. The quasi- 
momentum p of the phonon is in the direction coinciding with 
that of the propagation of the acoustic wave. The most essential 
difference between the quasi-momentum and the momentum 
is that in the collision of phononsincrystals the quasi-momentum 
can be transferred in discrete portions to the crystal lattice. 
In this case it is not conserved*. 

In short, the quantization of the acoustic field leads to phonons 
in the same way as the quantization of the electromagnetic field 
leads to photons (37.1.4). 

41.7.6 Phonon spin equals zero and, therefore, phonons comply 
with Bose-Einstein statistics (44.2.2). Phonons can be emitted 
and they can be absorbed, but their number is not maintained 
constant, hence the chemical potential for phonon gas (41.2.3) 
equals zero (cf. Sect. 44.2.6). 

The energy of a crystal can be dealt with as the energy of a pho- 
non gas and can be calculated as in Sect. 41.2.6. The number dN 
of phonons with frequencies in the range from v to v -+ dv is 


dg 

aN = ——_-—__ , 
ghv/hT __4 

* A detailed discussion of the properties of quasi-particles, 

associated with the role of periodicity of crystal structure, is 

beyond the scope of this handbook. 
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where dg is the number of quantum states (number of cells) 
in volume V of the crystal: 
4np? dpV _, 41v? dv 


h3 7 v3 V, 


dge =3 
where p = hv/v is the quasi-momentum of the phonon, and v 
is the velocity of sound in the crystal. The factor 3 takes into 
account the fact that longitudinal and transverse waves, with 
two mutually perpendicular polarizations, can propagate in 
a crystal* (31.1.8). Thus 


127v? dvV 


P a NET dy 


The internal energy U (9.1.2) of a crystal (accurate to zero- 
point energy) is 


vV 


max par Vmax T 
JUV ft V? av 
gm \ wanes | oem 
0 0 


where Vmax = v (3N/4nV)1/ is the upper frequenc 
of the phonons that contribute to the energy of 
tion of the crystal. An assessment of the order/of magnitude of 
Vmax is given in Sect. 41.7.4. 

Introduced in calculating U is the Debye characteristic tempera- 
ture Tp = hvmax/k. The following two limiting cases are con- 
sidered. 

(a) High temperatures T > Tp. 


max 


LT \ y2 dy — VRT 
0 


hv/hT __4 A hv {20V 
ps 


ITere e IT z3 


x 


3 


Vv 
x aa —=3NkKT. 


* The difference in the velocities of the longitudinal and trans- 
verse waves is ignored, the number of quantum states of the 
phonons being simply tripled. 
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For one mole of the crystal N = Na, where Na is Avogadro's 
number (Appendix II), and the molar heat capacity C,, complies 
with the Dulong-Petit law (44.7.2): 


ôU 
(b) Low temperatures T < Tp. 
Vmax 
In calculatin the integral | aea 2 eee a ne 
8 8 exp (hv/kT)—1 


variable, =hv/kT, is introduced and the upper limit is 
replaced by oo. Thus 


Vmax co 
U= 142nVh \ v3 dv _ 120Vh G )’ C EdE 
~ p exp (hv/kT)—1 v? h \ eb —4 
0 
Ank _, 
=- 


The molar heat capacity is proportional to the cube of the abso- 
lute temperature and obeys the Debye TÌ law. Thus 


ðU 16n®k4VT3 


= I ———— 3, 
Cy aT Bh a3 const T 


41.8 Band Theory of Solids 


41.8.1 Long-range order in crystals (40.1.1) leads to a situation 
in which an electric field, which is a periodic function of the 
coordinates, exists in solids. In a metal, for example, where the 
positive ions are located in strict order at the lattice points, 
the potential energy of an electron varies along some direction 
OX as shown in Fig. 41.14. Energy minima correspond to the 
places where the positive ions are located. 

41.8.2 A periodic electric field in a crystal of any type (40.1.3) 
essentially changes the energy states of the electrons in a solid 
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is compared to their state in single atoms. In single atoms the 
electrons are in discrete energy states (38.2.5). In a solid, the 
onergy states of the electrons are determined both by their 
interaction with the nucleus of their atom and by the electric 
cld of the crystal lattice, i.e. by their interaction with other 
atoms. As a result of this interaction, the energy levels of the 


Nereis’: 


Fig.t41.444 


electrons are split. Thus, in a solid containing N interacting 
atoms, N closely spaced energy levels are formed in place of 
the discrete energy level typical of a single atom. These N 
energy levels‘make up an energy band. A band energy spectrum 
of the electrons is formed in crystals. 

41.8.3 The, formation of a band energy spectrum in 
follows from the indeterminacy principle (38.6.6). Owing to the 
linite lifetime t of the electron in an excited state (t ~ 10-8 s) 
(39.1.5) the width AZ of the energy level in a single atom is 


crystal 


(natural energy level width). 

‘I'he valence electrons (39.3.9) of the atoms in a crystal are more 
weakly bound to the nuclei than the inner-shell electrons and 
can pass from one atom to another by means of the tunnel effect 
(38.7.2). In this way electrons penetrate the potential barrier 
(38.7.1) separating the atoms of the crystal. 

To assess the average lifetime t of a valence electron in a given 
alom it is assumed that the penetrability D of the barrier is 
expressed by the simple formula (38.7.2) 


-2 V 2m(Uo— E) L 
Dre i 


JUS 
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where Up — E is the barrier height, consisting of the ioniza- 
tion energy, and amounts to approximately 10 eV, and Z is 
the barrier width, which is commensurate with the crystal 
lattice constant: L ~ d ~ 10-1 m. The frequency v of penetra- 
tion of an electron through the barrier is 


‘5 j -2 VimU E) L 
vacay 

d d : 
where v is the velocity of the electron in the atom (potential 
well) and can be taken equal to v-~ 10° m/s. The average life- 
time t of an electron in the given atom is 


2 ee ee ee 
d a V2m(Uo- E) L 
v 


1 
T= >X 
y 


After substituting the values of all the quantities, t ~ 10-715 s 
is obtained. From the indeterminacy relation 


EEE eV. 
T 


Instead of the natural width AE ~ 10 eV of the electron 
energy level in a single atom, a band of allowed values of energy 
is formed in a crystal. 

41.8.4 The frequency of the penetration of inner-shell electrons 
in atoms through a potential barrier and their transitions to 
other atoms is negligible. This is due to the increase in barrier 
height: U — E ~ 10% eV and the increase in barrier width: 
L %3 x 10- m. According to calculations, t ~ 10? years. 
The broadening of the energy levels of the inner-shell electrons 
is unessential, and these electrons behave in crystals practically 
in the same way as in single atoms. 

41.8.5 If N is the total number of atoms in a solid, then the 
energy band, formed from the electronic energy level of the 
valence electron of the atom consists of N closely spaced levels. 
Adjacent energy levels in a band are at about 10-22 eV from each 
other. 

The allowed energy bands are separated by regions that are 
forbidden bands of electron energy values. The width of the for- 
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hidden bands is commensurable with that of the allowed bands. 
With an increase in energy the width of the allowed energy 
bands increases and that of the forbidden bands decreases. 
A schematic diagram of the energy bands of a solid is shown 
in Fig. 41.42. 

41.8.6 The allowed energy bands in a solid can be filled in 
different ways by electrons. In limiting cases they can be com- 
pletely filled or entirely empty (41.9.1). The electrons in a solid 
can move from one allowed band to another. For an electron 


£ 
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R 

Bands of forbidde 

energy value. 

Fig. 41.12 


lo move from a lower band to the next higher one it is necessary 
to expend energy equal to the width of the forbidden band be- 
tween the two allowed ones (of the order of several eV). 

Very little energy is required for intraband transitions. For 
example, an energy of 10-4 to 10-8 eV is sufficient for this pur- 
pose, and it is acquired by an electron in a metal by the action 
of an electric field over a free path length at ordinary potential 
differences. But this energy is insufficient to move an electron 
from one band to another. The action of thermal excitation can 
impart various energies to electrons, sufficient both for intra- 
band and interband transitions. 
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41.9 Metals and Dielectrics in the Band Theory 


41.9.1 Differences in the electric properties of solids are ex- 
plained by the band theory (41.8.2) by the fact that the allowed 
energy bands (41.8.2) are differently filled by electrons and by 
the different widths of the forbidden bands (41.8.5). These two 
factors determine whether a given solid is a conductor of electric 
current or is a dielectric. A necessary condition for a solid to be 
a conductor is the availability of empty energy levels to which 
the electric field can raise the electrons. It should be taken into 
consideration that the field can bring about only intraband 
transitionsyof the electrons (41.8.6). 

41.9.2 Typical examples of conductors are the metals of the 
first group of Mendeleev’s periodic table (39.3.5), for instance, 
sodium. In a single atom of sodium there are two filled electron 
shells, containing 2 and 8 electrons, respectively (39.3.4). 
According to the Pauli principle (39.3.1), the eleventh valence 
electron of the sodium atom only fills half of the upper energy 
level. In a crystal of sodium, the first two filled shells of single 
atoms correspond to the energy bands completely filled with 
electrons. 

The valence electrons of the sodium atoms in a crystal half 
fill the levels of the band of allowed energy values (horizontally 
hatched in Fig. 44.13). This is called the conduction band because 
the electrons in it participate in producing the conduction cur- 
rent (20.1.2). By the action of the electric field, set up in the 


forbidden band | 


Empty energy levels 


A Filled i levels _ 
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crystal by the source of electrical energy, the valence electrons 
acquire more energy and move to higher, unoccupied energy 
levels in the conduction band. This initiates their ordered motion 
and produces a current in the crystal. a | 
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Thus, if the band is not completely occupied by valence elec- 
trons, a solid is always a conductor of electric current. 
41.9.3 Hybridization of allowed energy bands may occur in 
crystals. A band formed in splitting an upper excited level may 
overlap a band produced by the splitting of a lower state of 
valence electrons. This phenomenon is observed in the crystals 
of elements of the second group in Mendeleev’s periodic table 
(Be, Cd, Mg, Zn and others). This forms a wider hybrid band 
in which valence electrons are located, filling it only partly. 
Hence, a hybrid band is a conduction band, and such crystals are 
conductors of electric current. 

41.9.4 The energy bands do not overlap in solid dielectrics, 
and the band that combines the energy levels of the valence 
electrons of atoms or ions is completely filled with electrons, 
whereas all higher bands are entirely empty at T = 0 K. The 
band completely filled with electrons is called the valence band. 
The empty bands are conduction bands. An example of a crystal- 
line dielectric is common salt eae): The NaCl molecule has 
a heteropolar bond (39.4.4), which leads to the formation of 
Na+ and Cl- ions with completely filled electron shells. The 
NaCl crystal has a valence band of the Cl- ion and all lower 
bands are completely filled, whereas the upper Nat ion band, 
located above, is entirely empty. The Cl- and Nat bands are 
separated by 6 eV, and the source of electrical energy cannot 
excite the electrons from the completely filled Cl- band to the 
empty Nat conduction band. This explaj e dielectric prop- 
erties of NaCl. 


41.10 Electrical Conduction of Semiconductors 


41.10.1 Semiconductors include a large class of substances 
whose resistivity varies in a wide range from 10- to 108 Q-m, 
and is reduced very rapidly, according to an exponential law, 
as the temperature is raised (41.10.2). The most typical and 
extensively employed semiconductor materials are the chemical 
clements germanium, silicon and tellurium. In the outer shell 
of germanium and silicon atoms there are four valence electrons, 
which form covalent bonds (39.4.5) with the valence electrons 
of adjacent atoms. There are no “free” valence electrons in 
chemically pure crystals of these semiconductors. On the basis 
of the band theory (41.8.2) crystalline semiconductors belong 
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to the type of solids in which the valence band (41.9.4) is sepa- 
rated from the empty conduction band (at 7 = 0 K) by a com- 
paratively narrow energy region jAZ, (Fig. 41.14), less than that 
in dielectric crystals (41.9.4). 

Hor silicon AE, = 1.1 eV; 

. a or germanium AE =0.72 eV. 
Conduction bend 41.10.2 The electrical con- 
duction of chemically pure 

semiconductors is called the 

Af, intrinsic conduction. Electron 
conduction (n-type conduction) 
Valence band occurs when electrons are 
transferred from the va- 
lence band to the conduction 
band. This requires an expen- 
diture of energy, not less than 
the width AE, of the for- 
bidden band (Fig. 41.14). The quantity AE, is called the activa- 
tion energy of intrinsic conduction. As the temperature of the 
semiconductor is raised, an increased number of electrons move 
from the valence band to the conduction band and participate 
in electrical conduction. The conductivity (20.3.4) of semicon- 
ductors increases with the temperature T according to the law 


—~AEo/2kT 
? 


Fig. 41.14 


where k is Boltzmann’s constant (9.4.5). The resistivity (20.3.4) 
of semiconductors drastically decreases as the temperature is 
raised according to the relation 


p = pp eA Eo/2hT, 


This feature of semiconductors essentially differentiates them 
from metals (20.3.7). | 
41.10.3. The transition of an electron from the valence band 
of a semiconductor to the conduction band indicates that the 
covalent bonds (39.4.5) of the atoms are violated in the semi- 
conductor crystal. A certain one of the valence electrons of one 
of the atoms in the lattice leaves its place. The abandoned place, 
or vacancy, displays surplus positive charge. Such a vacancy 
is called a positive hole, and is treated as a positive charge 
carrier. In the band theory this means that a vacant energy 
level appears in the valence band of the crystal. A positive 
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hole behaves like a positive charge, equal in magnitude to the 
charge of the electron. Another electron may move to the place 
(hole) abandoned by the previous electron. This is equivalent 
to the movement of the positive hole; it appears at a new place, 
left by the second electron. In the external electric field, the 
electrons throughout the mass of the semiconductor move in 
the direction opposite to that of the electric field strength 
(15.1.2). The positive holes move in the direction of the field 
strength, i.e. in the direction that a positive charge would 
travel when subject to an electric field. 

The electrical conduction of a semiconductor that is due to the 
motion of holes is called hole, or p-type, conduction. 
41.10.4 The intrinsic conduction of semiconductors is due to 
the presence of two kinds of current carriers: electrons in the 
conduction band and holes in the valence band. Each electron 
excited to the conduction band corresponds to one hole in the 
valence band. The electron concentration n, and hole concentra- 
tion n, are the same and increase rapidly with the temperature 
T according to the law 


~AEo/2kT 
3 


Ne = np = const e 


- 


where AF, is the activation energy Qf intrinsic conduction 
(41.10.2), and k is Boltzmann’s constant (9.4.5). This explains 
the nature of the dependence of the electrical conductivity and 
resistivity of semiconductors (41.10.2) on the temperature, which 
rone y differs from the dependence of these quantities in 
metals (20.3.7). 

41.10.5 The electrical conduction of semiconductors due to 
the presence in them of impurity centres is called eztrinsic, 
or impurity, conduction. 

Impurity centres (or simply impurities) may be atoms or ions 
of foreign elements, and various defects and distortion of the 
crystal lattice (vacant points, displacements due to deformation 
of the crystal, etc.). 

Impurities change the periodic electric field in a solid and affect 
the motion of the electrons and their energy states. The energy 
levels of the valence electrons of impurity atoms are not located 
in the allowed energy bands (41.9.2) of the main, or host, crys- 
tal, and impurity energy levels (localized levels) are formed in the 


forbidden band. 
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41.10.6 Impurities can be a supplementary source of electrons 
in a solid. For example, in the substitution of one tetravalent 
atom of germanium by an atom of an impurity having five 
valence electrons (phosphorus, antimony or arsenic) one of the 
electrons of the impurity cannot form a covalent bond (39.4.5) 
with the germanium atoms and is thereby “superfluous”. The 
energy levels of such electrons are below the conduction band 
of the host crystal (Fig. 41.15). Such levels, filled with electrons, 
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are called donor levels, and the atoms of the impurity that supply 
the “superfluous” electrons are called donor atoms. Only a small 
amount of energy AF, is required to excite electrons from the 
donor levels to the conduction band. This energy is obtained, 
for instance, from thermal excitation. For silicon AE, = 
= 0.054 eV if the impurity is arsenic. When electrons are trans- 
ferred from the donor levels to the conduction band, electron 
impurity, or extrinsic, conduction (n-type impurity conduction) 
occurs. Such semiconductors are called electron impurity semi- 
conductors (n-type semiconductors). 

41.10.7 Ifa tetravalent atom in the lattice of the semiconductor 
is replaced by a trivalent impurity atom (boron, aluminium or 
indium) there will be a lack of one electron to form saturated 
covalent bonds. The lacking electron can be borrowed from the 
lattice of the neighbouring atom of germanium, in which a posi- 
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tive hole is thereby formed (41.10.3). The consecutive filling 
of the holes formed in the germanium atoms by electrons is 
equivalent to the movement of the holes and leads to conduction 
in the semiconductor. The impurity energy levels, not occupied 
by electrons, are called trapping, or acceptor, levels. In this 
case, the impurity atoms are said to be acceptor atoms. Acceptor 
levels are located above the upper edge of the valence band 
(41.9.4) of the host crystal (Fig. 41.16). When silicon crystals 
are doped with boron (the addition of impurities to a semicond- 
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ductor is called doping), the acceptor levels are an amount 
AE, = 0.08 eV above the valence band. The transfer of elec- 
trons from the filled valence band to the acceptor level produces 
positive holes in this band, and the valence band becomes 
a band of hole conduction. This leads to hole impurity conduction 
(p-type impurity conduction). Such semiconductors are called 
hole impurity semiconductors (p-type semiconductors). 

41.10.8 When a semiconductor is doped with both donor and 
acceptor impurities at the same time, the nature of the conduc- 
tion (n- or p-type) depends upon which impurity produces the 
higher concentration of current carriers. The concentration and 
energy of the electrons (and holes) in semiconductors depend 
very strongly, in contrast to those of metals, on the tempera- 
ture, and increase when the temperature is raised. 
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41.11 Concept of Contact Electrical Phenomena 
in Metals and Semiconductors 


41.41.14 The work function © in removing an electron from a 
metal is the work it is necessary to do for the electron to emerge 
from the metal into vacuum. The work ® is done against the 
attractive forces developed by the surplus positive charge in the 
metal resulting from the removal of the electron. Besides, it is 
necessary to overcome the repulsion forces of the previously 
emerged electrons if they have not been removed and form an 
electron “cloud” at the surface of the conductor. The work 
function has a value of the order of several eV and depends upon 


Fig. 44.47 


the kind of metal and the condition of its surface; it is changed 
by the presence of dirt, traces of moisture, etc. In the quantum 
theory of solids, the work function is reckoned from the upper 
electron-filled Fermi level (44.4.2) (Fig. 44.17). 

41.11.2 Since the fastest clectrons fly out of the metal, a lack. 
of negative charge is formed inthe metallic conductor and 
a surplus in the surrounding space. This is evident in a very 
thin layer (of a thickness equal to several interatomic distances 
in the metal) on both sides of the surface of the conductor. It 
is contended that the surface of the metal is a double electric 
ee oe layer) similar to an extremely thin capacitor 

7.2.4). 

41.11.3 The potential difference AV (16.2.7), characterizing 
the electric field of the double layer, is called the surface poten- 
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tial jump at the metal-vacuum boundary, or the contact potential 
difference between a metal and a vacuum. Thus 


ve. 
e 


where @ is the work function (41.11.1), and e is the magnitude 
of the electron charge. 

There is no electric field outside the double layer, and the po- 
tential outside the metal and beyond the double layer equals 
zero. Inside the metal the potential is positive and equal to AV. 
The potential energy of the conduction electrons (20.3.1) is 
negative and is equal to —eAV = —@. The whole volume of 
the metal is a potential well (38.4.1) for the conduction elec- 
trons; the depth of the well equals the work function ®. 
41.11.4 In the contact of two dissimilar metals, J and 2, 
having the work functions ®, and @, electrons pass predomi- 
nantly from the metal with the lower work function to that with 
the higher one. The metals acquire unlike charges and, in a state 
of thermodynamic equilibrium (8.3.3), there is a potential dif- 
ference AV} between the two contacting dissimilar metals. It is 
called the internal contact potential difference. 

The potential diffetence AV; between two points located close 
to but outside the surfaces of the first and second contacting 
metals is called the external contact potential difference. 

In the contact of the two metals shown in Fig. 41.17 (OD, < ®»), 
the electrons pass predominantly in the direction from metal 7 
lo metal 2 and they become charged positively and negatively, 
respectively. At the same time, a shift occurs in the energy 
levels of the electrons: upward in metal 2 and downward in 
metal J. In a state of thermodynamic equilibrium, the Fermi 
levels (41.4.2) in the two metals coincide (Fig. 44.18). 

It is evident from Fig. 44.18 that the external contact potential 
difference AV;, depends upon the work functions ®, and @,. 
Thus 

AVig= — A z n 


The internal contact potential difference depends upon the 
Fermi levels in the metals. Thus 


Ey — E 
Mig a R : 


e 
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The potential changes from V, to V> within the limits of the 
double electric layer (41.11.2) of thickness / (Fig. 41.19) that 
can be approximately assessed by the formula 


2eoAV ie (in SI units). 

eno 

Here no is the concentration of charges in the double layer, £o 
is the electric constant in SI units (14.2.7), and e is the electron 
charge. 

The thickness of the double layer in contacting metals is of 
the order of the lattice constants in metals, and the resistivity 


Fig. 41.18 Fig. 41.19 


of this layer is the same as that of the rest of the metal. Electric 
current passes equally well in both directions through the con- 
tact layer of two metals, and there is no rectifying (barrier-layer) 
effect of the contact, or junction, i.e. unilateral conduction. 
41.11.5 Contact between a metal and an n-type semiconductor 
(41.10.6) leads to unilateral conduction of the junction. Assume 
that D > Òn, where M, and @®,y are the work functions of the 
metal and the n-type semiconductor, respectively. The arrange- 
ment of the conduction band of the metal (41.9.2), the complete- 
ly filled band of the semiconductor and its donor levels before 
contact has been made are shown in Fig. 41.20. After the metal 
makes contact with the semiconductor, electrons from the donor 
bes of the n-type semiconductor (41.10.6) move into the 
metal. 
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The charge in the contact layer (41.11.2) is positive on the side 
of the n-type semiconductor and negative on the metal side. 
Owing to the low electron concentration in the semiconductor 
in comparison to that of the metal (101° cm instead of 1022 cm~’), 
the thickness of the contact layer at the metal-semiconductor 
boundary is found to be approximately 10% times greater than 
that in the metal. 

41.11.6 The small number of current carriers in the contact 
layer and its large thickness indicate that the resistivity of 
this layer is substantially greater than that of the rest of the 
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semiconductor. Here the contact layer is called the depletion, 
or space-charge, layer (formerly called the barrier, or blocking, 
layer). TheZdepletion layer is the basis, for rectification{of alter- 
nating current at a metal-semiconductor junction. When the 
metal is connected to the positive terminal of a source of elec- 
trical energy, and the semiconductor to the negative terminal, 
the external electric field is directed from the metal toward the 
semiconductor. In this case the electrons are pulled from the 
bulk of the semiconductor into the double layer. The thickness 
of this layer is reduced and the conduction increases. In this 
conducting direction, electric current passes through the metal- 
semiconductor junction. When the external electric field is 
directed from the semiconductor toward the metal, the electrons 
are forced out of the double layer deep into the semiconductor. 
This increases the width and resistance of the depletion layer 
and in this reverse (nonconducting) direction, the metal-semicon- 
ductor junction blocks the electric current. This unilateral 
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conduction of the junction means that an alternating current 
passing through it is rectified. 

41.11.7 The region of contact of two semiconductors having 
different, n- and p-type, conduction is called an electron-hole 


Fig. 44.22 Fig. 44.23 


anne (p-n junction). A double electric layer (41.11.2) is 
formed at a p-n junction as a result of electron and hole motion 
through the interface. Electrons from the n-type semiconductor 
pass into the p-type semiconductor, and the holes pass in the 
opposite direction. This forms surplus charges of opposite sign 
in the ab regions of the n-type semiconductor and the be regions 
of the p-type semiconductor (Fig. 41.21). The double layer of 
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thickness J sets up a contact electric field with the strength Enp 


and a certain potential difference at the boundaries of the layer. 
This field impedes further counter migration of electrons and 
holes. At a definite thickness of the p-n junction a state of 
equilibrium is set up, corresponding to the equalizing of the 
Fermi levels in the two semiconductors. This forms an equilibri- 
um contact layer, which is a barrier layer (41.14.6), having 
higher resistance than that of the remaining regions of the 
semiconductors. 

41.11.8 When a p-n semiconductor junction is connected into 
the external circuit of a source of electrical energy in the way 
shown in Fig. 41.22, the external (applied) electric field, strength- 
ening the field of the contact layer, causes electrons to migrate 
to the n-type semiconductor and holes to the p-type semiconduc- 
tor at opposite sides of the junction. The thickness of the barrier 
layer and its resistance increase. This direction of the applied 
electric field is said to be the reverse one. In this direction practi- 
cally no current passes through the p-n junction. When the polar- 
ity of the external applied voltage is reversed (Fig. 41.23), 
the external electric field with the strength Eext opposes the 
field of the contact layer (41.11.7). The counter motion of elec- 
trons and holes, forced by the external field to move from the 
bulk of the semiconductors to the region of the p-n junction, 
increases the number of mobile current carriers at the junction. 
This reduces the thickness and resistance of the contact layer, 
and, in this forward direction, current passes through the p-n 
junction. The blocking effect of a p-n junction is similar to the 
rectification of a two-electrode valve, or tube, called a diode. 
Accordingly, a semiconductor device containing one p-n junc- 
tion is called a semiconductor diode. Crystal triodes, or transistors, 
contain two p-n junctions. 
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PART EIGHT NUCLEAR PHYSICS 
AND ELEMENTARY PARTICLES 


CHAPTER 42 STRUCTURE AND BASIC 
PROPERTIES OF ATOMIC NUCLEI 


42.1 Main Properties and Structure of the Nucleus 


42.4.1 The nucleus is the central part of an atom; practically 
all of the atom’s mass and its positive charge are concentrated 
in the nucleus. All atomic nuclei are made up of elementary 
particles (43.1.1) called protons and neutrons, considered to be 
two charge states of a single particle called the nucleon. The 
proton has a positive electric charge, equal in magnitude to that 
of the electron (14.1.2). The neutron has no electric charge. 
42.1.2 The charge of the nucleus is the quantity Ze, where e is 
the electron charge, and Z is the atomic number of the chemical 
element in Mendeleev’s periodic table (39.3.5) and is equal to 
the number of protons in the nucleus. Up to date nuclei are 
known with Z ranging from Z = 1 to Z = 107. For all nuclei, 
with the exception of {H, ?He and certain other neutron-defi- 
cient nuclei, N > Z, where N is the number of neutrons in the 
nucleus. For light nuclei N/Z ~ 1; for the nuclei of chemical 
elements at the end of the periodic system, N/Z ~ 1.6. 
42.1.3 The number of nucleons (42.1.1) in a nucleus, A = N + 
+ Z, is called the mass, or nucleon, number. Nucleons (proton 
and neutron) are assigned a mass number equal to unity. The 
electron has zero mass number. | 
Nuclei with the same Z value, but different A values are called 
isotopes. Nuclei with the same A value, but ‘different Z values 
are called isobars. The nucleus of chemical element X is denoted 


by the symbol ox; where X is the symbol of the chemical ele- 


ment. 

About 300 stable isotopes of chemical elements are found in 
nature, and there are over a 1000 man-made (radioactive) 
isotopes. 
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42.1.4 The size of a nucleus is specified by the nuclear radius r, 
which has only an arbitrary meaning because the nucleus has 
no clearly defined boundary. The empirical formula for the 
nuclear radius is r = r,A¥/3, where rọ = (4.3 to 1.7) 10-1 m 
can be interpreted as the proportionality of the volume of the 
nucleus to the number of nucleons it contains. 

The density of nuclear matter is of the order of magnitude of 
1017 kg/m? and is constant for all nuclei. It substantially exceeds 
the density of the most dense of ordinary substances. 

42.1.5 Nucleons in atomic nuclei are fermions (41.2.4) and 
their spin equals #/2. The nucleus of an atom has intrinsic 
angular momentum, called nuclear spin, equal to Ly = 
= VI (I -|- 1) it, where Z is the inner (total) spin quantum nu 
(cf. Sect. 26.1.3), A:-h/27, and h is Planck’s constant (Appen- 
dix IT). 

The ae I may assume any whole or half-integer\value: 
0, 1/2, 1, 3/2, 2, etc. Nuclei with even mass numbers Æ have 
integer spin (in A units) and comply with Bose-Finstein‘s 
tistics (44.2.1). Nuclei with odd mass numbers A have half- 
integer spin (in # units) and comply with Fermi-Dirac sta- 
tistics (44.2.4). 

42.1.6 Nuclear particles have their intrinsic magnetic moments, 
which determine the total nuclear magnetic moment Pmn. The 
unit of measurement of nuclear magnetic moments is the nuclear 
magneton uy analogous to Bohr’s magneton (26.1.4). Thus 


eh 
=——— (in § it 
UN Imp (in SI units), 


eh 
2M pc 


UN = (in Gaussian units), 


where e is the magnitude of the electron charge, mp is the mass 
of the proton, and c is the electrodynamic constant (Appendix IT). 
The nuclear magneton is a factor of mp/m,. = 1836.5 times less 
than the Bohr magneton. It follows that the magnetic proper- 
ties of atoms are determined by those of their electrons. 

The following relation exists between nuclear spin Ly, measured 
in A units, and the magnetic moment of the nucleus, measured 
in nuclear magnetons: 


Pun = YNEN, 
LO* 
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where yy is the nuclear gyromagnelic ratio (cf. Sect. 26.4.4). 
The magnetic moments of the neutron and proton are associated 
with their complex internal structure (43.6.2). The neutron 
has a negative magnetic moment equal to —(1.91314 + 
+ 0.00005) un. The spin of the neutron and its magnetic mo- 
ment are opposite in direction. The magnetic moment of the 
proton is positive and is equal to (2.79277 -|- 0.00003) uyn. 
Its direction coincides with the direction of proton spin. 
42.1.7 In the general case, the distribution of clectric charge 
of the proton in the nucleus is asymmetrical. A measure of the 
deviation of this distribution from spherical symmetry is the 
electric quadrupole moment of the nucleus Q. If the charge density 
is assumed the same throughout the nucleus, Q is determined 
only by the shape of the nucleus. For an ellipsoid of revolu- 
tion 

2 
Q = er Ze (b2—a?), 


where b is the semiaxis of the ellipsoid along its spin, and a 
is the semiaxis in the perpendicular direction. For a nucleus 
stretched out along its spin, b >a and Q > 0. For a nucleus 
flattened in this direction, b< a and Q < 0. For spherical 
distribution of the charge in the nucleus, b = a and Q = 0. 
This is valid for nuclei with spin equal to 0 or to (4/2) À. 


42.2 Binding Energy of Nuclei. Mass Defect 


42.2.1 In nuclei, nucleons are in states that differ essentially 
from their free states. With the exception of the nucleus of ordi- 
nary hydrogen, all nuclei contain at least two nucleons, between 
which there exists a special nuclear strong interaction—at- 
traction—that ensures stability of the nucleus notwithstanding 
the repulsion of the protons of like charge. 

42.2.2 The binding energy of a nucleon in a nucleus is the phys- 
ical quantity equal to the work required to remove the nuclecn 
from the nucleus without imparting kinetic energy to it. 

The nuclear binding energy is the amount of work required to 
take the nucleus apart into its component nucleons without 
imparting any kinetic energy to them. It follows from the law 
of energy conservation that in forming a nucleus the same amount 
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of energy is released as is required in splitting the nucleus 
into its component nucleons. The nuclear binding energy is the 
difference between the total energy of all the free nucleons that 
make up the nucleus and their total energy in the nucleus. 
42.2.3 In forming a nucleus, its mass is reduced: the mass of the 
nucleus is less than the total mass of its component nucleons. 
This reduction in mass in the formation of a nucleus is due to 
the release of the binding energy. If AEp is the amount of energy 
released in forming a nucleus, the corresponding mass Am 
(5.7.6), equal to 

FAEy 
m=; 


? 


is called the mass defect and characterizes the reduction in the 
total mass in forming a nucleus from its component nucleons. 
If a nucleus of mass my is formed of Z\protons with the mass mp 
and of (A — Z) neutrons of mass m,, then 


Am = Zmy + (A — Z) m — mn. 

Instead of the mass my of the nucleus, the quantity Am can be 
expressed in terms of the atomic mass m,. Thus 

Am = Zmy + (A = Z) m — ma, 


where my is the mass of the hydrogen atom. 

In practice, in calculating Am, the masses of all the particles 
and atoms are expressed in atomic mass units (Appendix II). 
The mass defect serves as a measure of the nuclear binding ener- 
gy. Thus 


AEy = Amc? = [Zmp + (A — Z) m — mn] c?. 

One atomic mass unit corresponds to one atomic unit of energy, 
which is equal to 931.5016 MeV. 

42.2.4 The specific binding energy AE}" of the nucleus is the 
binding energy per nucleon: AE" = AEy/A. The quantity 
AE}," equals 8 MeV/nucleon on an average. A curve showing 
the dependence of the specific binding energy on the mass num- 
ber A and characterizing the different bonding strengths for 
the nucleons in the nuclei of various chemical elements, is 


given in Fig. 42.4. The nuclei of elements in the middle of the 
periodic system (39.3.5) (28 < A < 138), i.e. from 7§Si to 
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138Ba have the strongest bonds. In these nuclei AE," is close 


to 8.7 MeV/nucleon. As the number of nucleons in the nucleus 
increases, the specific binding energy decreases. The atomic 
nuclei of chemical elements located at the end of the periodic 
system (for instance, the nucleus of uranium) have the binding 


_ Specific binding energy Eg, MeV /nucleon 


g 4Q 80 120 160 200 240. 
Mass number 


Fig. 42.4 


energy AE?" ~ 7.6 MeV/nucleon. This explains the feasibility 


of releasing energy by the fission of heavy nuclei (42.9.2). In 
the region of low mass numbers there are sharp peaks of specific 
binding energy. Maxima are typical for nuclei with even num- 
bers of protons and neutrons ($He, 12C, #80). Minima are typical 
for nuclei with odd numbers of protons and neutrons (Li, ,B, ,N). 
42.2.5 Data on the binding energy of the nucleus and an appli- 
cation of the liquid-drop model of the nucleus* enabled certain 
laws of nuclear structure to be established. 


* The liquid-drop model of the nucleus is discussed in the phys- 
ics course of secondary schools. 
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The atomic nuclear stability criterion is the relation between the 
numbers of protons and neutrons in a stable nucleus for the 
given isobars (42.1.3) (i.e. A = const). The condition for mi- 
nimum energy of the nucleus yields the following relation be- 
tween Z,, and A: 


A 


Zst = T980.015A2E * 


The stable atomic Z 4 are the nearest whole numbers 


to those obtained by this fotnula. 

At low and medium values of X there are approximately equal 
numbers of protons and neutrons in stable nuclei: N ~ Z or 
A ~ 2Z. Especially stable are nuclei (when Z = A/2) for 
which N or Z = 2, 8, 20, 28, 50, 82 or 126. These have been 
called magic numbers. Nuclei in which both N and Z are magic 
numbers are said to be double-magic. 

As Z increases, the Coulomb forces of repulsion between the 
protons increases in proportion to Z (Z — 1) ~ Z? (pairwise 
proton interaction), and, to compensate for this repulsion 
by nuclear attraction, the number of neutrons must increase 
more rapidly than,the number of protons. 


42.3 Nuclear Forces 


42.3.1 Nuclear interaction indicates that special forces exist 
in nuclei that do not belong to any of the types of forces known 
m classical physics (e.g. gravitational and electromagnetic 
orces). 

42.3.2 Nuclear forces are of short-range action. They only act 
at the extremely short distances, of the order of 10-15 m, between 
nucleons in a nucleus. This distance, ranging from 1.5 to 2.2 X 
x 10-15 m, is called the action radius, or range, of the nuclear 
forces. 

42.3.3 Nuclear forces display charge independence: the attrac- 
tion between any two nucleons is the same and is independent of 
the charge state of the nucleons, regardless of whether they 
are protons or neutrons (42.1.1). The charge independence of 
nuclear forces follows from a comparison of the binding energy 
in mirror nuclei. These are nuclei having the same total number 
of nucleons, but the number of protons in one is equal to the 
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number of neutrons in the other. The nuclei of helium 3He and 
tritium ?T (superheavy hydrogen) can be taken as an example. 
The binding energies (42.2.4) in these nuclei are 7.72 MeV and 
8.49 MeV, respectively. Assuming charge independence of nuclear 
forces, the difference in the nuclear binding energies, equal 
to 0.77 MeV, is associated with the energy of Coulomb repulsion 
between two protons in the nucleus of 3He. Assuming this quan- 
tity equal to e?/4meyr (16.2.6), the average distance r between 
the protons in the 3He nucleus is found to equal 1.9 X 10-15 m. 
a is in agreement with the action radius of nuclear forces 
(42.3.2). 

42.3.4 Nuclear forces have the property of saturation. This 
is manifested in the fact that a nucleon in the nucleus interacts 
with only a limited number of the closest nucleons among its 
neighbours. Besides the short-range action of nuclear forces 
(42.3.1), this saturation also follows from the linear dependence 
of the binding energy in nuclei on the mass number A (42.1.3). 
If each nucleon in a nucleus interacted simultaneously with 
all (A — 1) nucleons in the nucleus, the binding energy of the 
nucleus would be proportional to the possible number of inter- 
acting pairs of nucleons in the nucleus, i.e. the number of 
combinations of A taken two at a time: A (A — 1)/2*. The 
dependence, in this case, of the binding energy on A would 
be quadratic, rather than linear, and this is in contradiction with 
the experimental data. Practically total saturation of nuclear 
forces is reached for the alpha particle (42.4.2), which is a stable 
structure. 

42.3.5 Nuclear forces depend upon the orientation of the spins 
of the interacting nucleons. This is confirmed by the different 
kinds of scattering of neutrons by the molecules of ortho- and 
parahydrogen. In a molecule of orthohydrogen, the spins of 
the two protons are parallel to each other; in a molecule of para- 
hydrogen they are antiparallel. If the interaction of a neutron 
with a proton was independent of the mutual orientation of 
their spins, the scattering of neutrons by molecules of either 
orthohydrogen and parahydrogen would be the same, even though 
in the case of parahydrogen the spin of the neutron is paral- 
lel ‘to that of one of the protons, and in the case of orthohydro- 
gen the spin of the neutron does not necessarily coincide with 
the spins of the protons. Experiments showed that the scattering 


* Assuming, of course, pairwise nuelear interaction. 
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of neutrons by parahydrogen is 30 times more than the scattering 
by orthohydrogen. This is a clear indication of the dependence 
of nuclear forces on the orientations of the spins of the interact- 
ing nucleons. 
Nuclear forces are not central forces (3.3.4). They cannot be 
represented as ones acting along lines joining the centres of the 
interacting nucleons. 

42.3.69No rigorous theory exists that can comprehensively 
explain the nature and laws of nuclear forces. One of the simplest 
theories is the exchange interpretation of nuclear forces, which 
are also called exchange forces, or forces arising in an exchange 
interaction. This theory is based on the idea that interaction be- 
tween two particles is accomplished by the exchange of a third 
particle. The exchange theory of nuclear forces assumes that the 
interaction between the nucleons of a nucleus is due to the ex- 
change of special particles called pions (43.3.4) and having the 
rest mass m,. The time Aż during which nucleons exchange pions 
(i.e. the time interval from the emission of a pion by one nucleon 
and its absorption by another), should not, according to the 
Heisenberg relation (38.6.5), exceed At > A/AE, where AE 
is the indeterminacy in the value of the nucleon energy. The 
quantity AF is related to the rest energy E,, of the new particle 
created in the region of the action radius R of the nuclear forces 
(42.3.2). Assuming that AZ = E, = m,c?, At ~ lm c. If 
it is assumed that the pion leaves the nucleon at the velocity 
vz, © c, it travels, during the time At, the distance 


h h 
MC MC 


It is quite natural to assume further that the pion travels a dis- 
tance equal to R during the time At, ie. RZ, = R x 1.5 X 
<x 10-45 m (42.3.2). Consequently, the mass of the pion my is 
equal to m, = A/R e ~ 250 me, where me is the rest mass of 
the electron. This agrees well with experimental data on the 
masses of pions (see Table 43.1). 


42.4 Radioactivity 


42.4.1 Radioactivity is the transformation of unstable isotopes 
(42.1.3) of one chemical element into isotopes of another element, 
accompanied by the emission of certain particles. Radioactivity 


634. 


TABLE 42¢1 


Radivactivity 


Alpha decay 


Beta decay 


B- decay 
Bt decay 


Electron cap- 
ture (e or K 
capture) 


Spontaneous 
fission 


Proton radio- 
activity 
Double proton 


radioactivity 
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Change in | Change in 
nuclear mass 
charge Z number A 
Z=2 A--4 
Za1 A 
Z+-1 A 
Z--1 A 
Z—1 A 


Z—(1/2) Z | A—(41/2) A 


Z—1 A-—1 
2—2 A--2 
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Essence of the process 


Emission of an alpha 
particle, which is a 
system of two pro- 
tons and two neutrons 
bound together. 


| Mutual conversions in 


the nucleus of a neu- 
tron ({n) intoa proton 
tp) and vice versa 
in —> p- (_fe4- 8v,) 
tp > in + Gte- 8vo) 
Ip + -? e > bn - (ove), 
where ĉłvẹe and Ive 
are the electron neu- 


trino and  antineu- 
trino. 
Particles emitted 


from the nucleus are 
enclosed in parenthc- 
ses. 


Fission of the nucleus. 
into two fragments 
of approximately 
equal mass and charge. 


Emission of a proton 
from the nucleus. 


Simultaneous emis- 
sion of two protons 
from the nucleus. 
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is also the mutual conversion of certain elementary particles 
into other ones (for instance, the conversion of a neutron into 
a proton with the creation of an electron and an electron anti- 
neutrino) (42.64). 

Natural radigactivity is that observed in the unstable isotopes 
that exist in\nature. 

Artificial, or induced, radioactivity is that acquired by isotopes 
as a result of nuclear reactions (42.9.1). 

42.4.2 The main kinds of radioactivity are listed in Table 42.1. 
42.4.3 Usually, all kinds of radioactivity are accompanied by 
the emission of gamma rays, which is hard, short-wave electro- 
magnetic radiation. Gamma rays are the main form that reduces 
the energy of excited products of radioactive transformations. 
A nucleus subject to radioactive decay is called a parent nu- 
cleus; the result of decay is called a daughter, or product, nucleus 
and, as a rule, is found to be excited. Its transition to the ground 
state is accompanied by the emission of a y photon. 

42.4.4 Spontancous decay of atomic nuclei obeys the radioac- 
tive decay law: 


N = Nye™, 


where N, is the number of nuclei in the given volume at the 
initial instant of time ¢ = 0, N is the number of nuclei in the 
same volume at the instant of time t, and A is the decay constant, 
which is the probability that the nucleus will decay in one 
second and is equal to the fraction of the nuclei that decays in 
unit time. 

The law of spontaneous radioactive decay is based on two assump- 
tions: (1) the decay constant is independent of external con- 
ditions, and (2) the number of nuclei, which decay during the 
time dt, is proportional to the number of nuclei present. These 
assumptions signify that radioactive decay is a statistical pro- 
cess and the decay of any given nucleus is a chance event, having 
a definite probability. The quantity 1/A is the average life (or 
lifetime) of a radioactive isotope. As a matter of fact, the total 
lifetime dN of a nucleus is equal to t| dN | == iAN di. The 
average life t of all initially existing nuclei is 

o0 (6,0) 


— 1 ( any ae a aa 
T= A di = >. 
0 
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42.4.5 A characteristic of nuclear stability against decay is the 
radioactive half-life (half-value period) 7,/,.. ‘This is the length 
of time after which half of the initial number of nuclei of the 
given radioactive substance undergoes decay. The relation 
between A and T3/2: 


In2 0.693 
Ti = N = À 


42.4.6 Natural radioactivity (42.4.1) is observed for the nuclei 
of atoms of chemical elements that are located beyond lead in 
Mendeleey’s periodic system (39.3.5). Natural radioactivity 
of light and medium-weight nuclei is observed only for the 
nuclei 42K, 83Rb, 442In, 188La, 425m, '28Lu and '82Re. 

The law of the conservation of electric charges (14.1.3) in nu- 
clear radioactive decay is 


Zye = Y Zie, 
= 

where Zye is the charge of the parent nucleus (42.4.3), and Z;e 
is the charge of the nuclei and particles created as a result of 
the radioactive decay. This law is also applicable in investi- 
gating all nuclear reactions (42.9.1). The rule for the conser- 
vation of the mass numbers (42.1.3) in natural radioactive phe- 
nomena is 


= 0.693T. 


where Ay is the mass number of the parent nucleus, and 4; 
is the mass number of the nucleus or particles obtained as a 


result of radioactive decay. 
42.4.7 Soddy’s displacement (lajans-Soddy) law for radioactive 
a and B- decays: 


ZX- 4G 4Y + 4He for œ decay 
ZX 7 4Y4+_Ye for p- decay. 


Here ok is the parent nucleus, Y is the symbol for the daughter 


nucleus, $He is a helium nucleus, _%e is the symbol for designat- 
ing an electron, for which A = 0 and Z = —1._ l 
If the daughter nucleus is also radioactive, a radioactive series, 
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or chain, of transformations is formed. Natural radioactive 
nuclei constitute three radioactive families, called the uranium 
series (#88U), thorium series (?32Th) and the actinium series 
(735 Ac). They are named after the longest-lived parent isotope 
of the series with the maximum half-life (42.4.5). After under- 
going a sequence of a and B- decays, all the families terminate 
in one of the stable nuclei of lead isotopes: ?8$Pb, 288Pb and 
20°Pb. The neptunium series, beginning with the transuranium 
element neptunium 2323Np, is obtained artificially (being a man- 
made element) and terminates in bismuth 228Bi. 

42.4.8 If there is a whole series of radioactive ‘decays and, during 
the time dt, ApNp dt nuclei decay from the total number N 

of parent nucle}, whereas AgNq dt daughter nuclei decay during 
the same time, the total change dN q in unit time in the number 
of nuclei of the daughter substance is expressed by the relation 


dNq 
dt 


In the case of dynamic equilibrium between the parent and daugh- 
ter substances dN ,/di = 0, and the conditions of radioactivity 
equilibrium are complied ‘with. Thus 


from which 
eo ha _ Tp 
= A Ty 
a. Tp and Tq are the half-lives (42.3.5) of the parent and 
daughter nuclei. The product A = AWN is called the activity 
of the given radioactive substance and represents the number 
of decays in unit time. Activity referred to unit mass of the 
substance is called specific activity. Activity is measured by 
the number of nuclei of a radioactive substance that decays 


in unit time (s~). The unit of activity is called the becquerel 
(Bq). 


= MpNp—AaNa- 


? 


42.5 Alpha Decay 


42.5.1 Alpha decay is the emission of alpha particles by the 
nuclei of certain chemical elements. Alpha decay (42.5.2) is 
a property of heavy nuclei having mass numbers A > 200 
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and nuclear charges Ze > 82. Formed inside such nuclei are 
separate alpha particles, each consisting of two protons and 
two neutrons. This is facilitated by the saturation of nuclear 
forces (42.3.4). The formed alpha particles are subject to more 
Coulomb force repulsion from the protons of the nucleus than 
single protons are. At the same time, an alpha particle is sub- 
ject to less nuclear attraction to the nucleons in the nucleus than 
single nucleons. 

42.5.2 The nucleus is a potential barrier (38.7.1) for alpha par- 
ticles and its height U is greater than Æ, the energy of the alpha 
particle in the nucleus. Alpha decay occurs by the penetration 
of alpha particles through the potential barrier by means of 
the tunnel effect (38.7.2). The formula for the penetrability D 
of the potential barrier indicates that even negligible changes 
in the energy of an alpha particle in the nucleus lead to a drastic 
change of the quantity 


exp { —+ j V ma [UGE] ae} 
x4 


This explains the great differences in the half-lives of alpha 
radiators, from 10° years to 41077 s, at a comparatively small 
increase in the energy of the alpha particles, from 4 to 9 MeV. 
42.5.3 The decay constant à (42.4.4) is related to the penetra- 
bility D of the potential barrier for alpha particles. For a sim- 
plified model of a potential barrier of length Z (38.7.2) 


à = Dn, 
where n is the number of collisions of the alpha particle with 


the barrier wall in unit time, equal to n = v/2L, and v = 


= y 2E/ Ma is the velocity of the alpha particle in the nucleus. 
The quantity L is commonly taken equal to the radius R of 
the nucleus (2R is the width of the potential barrier). 

The simplified formula for the alpha-radioactive decay constant 


1S 


ee ee 2E 2 — 
A = OR Wee -4 V 2ma (Uo— E) R]. 


42.5.4 The Geiger-Nuttall rule, or relation, states that the 
larger the decay constant A of a radioactive element, the longer 
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the range R, (path length) of the alpha particles it emits in 
air. Thus 


Ink =A + Bin Rg, 


where A and B are empirical constants, having different values 
for the various radioactive families (42.4.7). 

42.5.5 On the basis of data, which is given in Sects. 42.5.3 
and 42.5.4, it has been shown that the same alpha-radioactive 
element has several groups of alpha particles with different 
ranges (path lengths). Constancy of the ranges is observed 
within each group. It follows that the alpha particles emitted 
by the nuclei have a definite energy spectrum and, conseguently, 
the atomic nuclei have discrete energy levels. 


42.6 Beta Decay 


42.6.1 The concept of beta decay unites three kinds of nuclear 
transformations: electron decay (B~-), positron decay (B+) and 
electron capture (e or K capture). The first two kinds of trans- 
formations consist in the emission by the nucleus of an electron 
(or positron) and an electron antineutrino (or electron neutrino). 
These processes involve the transformation of one kind of nu- 
cleon in the nucleus into another: a neutron into a proton or a 
proton into a neutron. The transformations occur as follows: 


In —> {p+ _%e-+ve (P decay), 
ip—> an + 42e lve (Bt decay). 


Here jn and ip are the symbols for the neutron and proton, łe 
and ,%e are the symbols for the electron and positron, and $v, 
and We are the symbols for the electron neutrino and antineu- 
trino (42.4.2). 

In the case of e capture, the proton is transformed into a neutron 
as follows: 


1P H-1 > on H §Ve- 


This procedure consists in the disappearance of one of the elec- 
trons from the K shell of the atom, which is closest to the nu- 
cleus. The proton, in being transformed into a neutron, seems 
to capture an electron. This gave rise to the term electron cap- 
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ture (or e capture). A feature of this kind of beta decay is the 
emission from the nucleus of only a single particle, the neutrino 
8v.. An example of e capture is the transformation of the radio- 
active nucleus of beryllium jBe into a stable nucleus of lithium 


2Li. In contrast to B+ decay, electron capture is accompanied by 
characteristic X radiation, belonging to the K line of the cor- 
responding element (39.7.9). 

42.6.2 B- decay occurs in both natural and artifical radioactive 
nuclei; B+ decay is typical of only artificial radioactivity, i.e. 
the initiation of intrinsic radioactive radiation of nuclei due 
to the action of alpha particles, neutrons or other particles. 
This violates the conditions of stability of the atomic nucleus 
(42.2.5). For example, the artificially radioactive isotope of 
carbon ł4G is obtained from the stable nucleus of nitrogen by 
the action of neutrons and with the emission of protons: 


MN gn '§C-+ fp 


and, after undergoing beta decay, is again transformed into 
the stable isotope #4N. Thus 


"$C — 13N + -16 + Ve. 


Violation of the stability conditions (42.2.5) by introducing 
surplus protons into the nucleus leads to artificial B+ decay. 
This is evident from the following example: 


B+ $He > 14N > N+ $n, 
1¢ 
IN -> 18C- e+ fve, 


where ĉe is a positron and $v, is an electron neutrino. 

42.6.3 Natural B- decay takes place so that the neutron jn 
is spontaneously transformed into a proton. The rest energy 
(5.7.3) of the neutron exceeds that of a hydrogen atom (i.e. 
the proton and electron taken together) by 782 keV. Hence, 
a transformation of the type (42.6.1) 


in > {p+ -feie 
is feasible with respect to energy outside the nucleus as well. 


The radioactive decay of free neutrons has been observed in high- 
intensity streams of neutrons released in nuclear reactors. 
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This decay takes place with a half-life (42.4.5) of (4.01 + 
-- 0.03) x 10% s. In heavy nuclei, overloaded with neutrons, 
such transformations lead to B- natural radioactivity. 
Transformations (42.6.1) of the type tp — jn + fe + $v, are 
only possible in nuclei where the required energy is borrowed 
from neighbouring particles. This transformation leads to 
artificial B+ decay. | 

42.6.4 The half-lives of beta decays (42.4.5) vary for various 


sources of B= radioactive radiation in an extensive time range 
from 10-2 s to 1012 years, which is incommensurably long com- 
pared to nuclear time (10-22 to 10-233 s) (43.2.6). This indicates 
that beta decay is duc to weak interaction (43.2.8). 

42.6.5 The decisive experimental fact in understanding the 
mechanism of B- decay and establishing a suitable theory was 


aN 
de 


‘B spectrum limit 


a Emar E, MeV 
Fig. 42.2 


an investigation of the energy spectrum of the emitted electrons. 
This spectrum was found to be continuous, extending to E = 
= Emax (Fig. 42.2). The energy Emax is called the upper energy 
limit of the P- spectrum; it is a characteristic of the source of 
P- radioactive radiation. Electrons with an energy exceeding 
Emax cannot be obtained with the given source. 

42.6.6 To reconcile the continuity of the clectron energy spec- 
trum with the discrete nature of the energy levels of the nucleus 
(42.5.5), it is necessary to assume that, together with the elec- 


4£1--0186 
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tron _{e, another particle, the electron autineutrino* $v,, is 
emitted from the nucleus. The total energy lost by the nucleus 
in a beta decay is equal to Emay, but it is variously distributed 
between the electron and electron antineutrino**. In particular, 
the limit point of the curve in Fig. 42.2 means that all the energy 
of the B- decay is carried away by the electron. Zero energy of 
the electron on the curve would correspond to all the energy 


being carried away by the antineutrino ĝve. 

For the P- radioactivity of free neutrons Emax = 782 keV, 
which fully corresponds to that discussed in Sect. 42.6.3. 
42.6.7 Neither the mass number A nor the spin of the nucleus 
(42.1.5) changes in a beta decay. The conservation of spin is 
explained by the fact that the spin of the electron antineutrino 
is h/2 (see Table 43.1), and the combined spins of this particle 
and of the electron, also equal to ii/2, are equal to zero when 
the spins of the two particles are oppositely oriented. 


42.7 Gamma Rays 


42.7.4 Gamma rays constitute hard electromagnetic radiation 
whose energy is released in nuclear transitions from excited 
energy states to the ground or less excited states (42.5.5), as 
well as in nuclear reactions. In the first case, according to the 
Bohr frequency condition (39.1.6), the energy of a photon of 
gamma rays is equal to the difference in the energies of the 
final and initial energy levels of the nucleus: 


ivi, = E; — Ey = AE, 


where v;, is the frequency of the photon corresponding to the 
transition of the nucleus from the state with energy E; to the 
state with energy Ep. The magnitude of AF;, is of the order of 
0.4 MeV and considerably exceeds the difference of the ener- 
gies of the electron levels in the atom (39.1.5). Gamma rays are 
extremely short-wave electromagnetic radiation of a wavelength 


not exceeding 10-2 nm, i.e. 0.1 A. The discrete line spectrum of 


* Antiparticles are discussed in Sect. 43.5.4. 
** It is evident that Æ max determines the diflerence AF; in 
the energies of two levels of the nucleus. 
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gamma rays is confirmation of the aforesaid (see Sect. 42.7.4). 
42.7.2 Gamma radiation is not an independent type of radio- 
activity (42.4.2). It accompanies the processes of œ and B de- 
cays and causes no changes in the charge or mass number of 
nuclei. It has been established that gamma rays are emitted 
by a daughter nucleus (42.4.3) which turns out to be excited at 
the instant it is formed (42.7.4). The excited nucleus loses its 
energy during a time ranging from 10-35 to 10-14 s, which is 
considerably less than the lifetime of the excited atom (approx. 
10-8 s). 

42.7.3 he origin and properties of gamma rays are confirmed 
by the laws of internal conversion of gamma rays, which consists 
in photoelectric-effect phenomena (37.1.1) of the electrons of 
the atom’s inner shells as a result of gamma radiation of its 
nucleus. Electrons formed by such an internal photoelectric 
effect are said to be conversion electrons. 

In many cases all the energy of the gamma rays is expended on 
internal conversion phenomena, and only conversion electrons 
are observed instead of gamma radiation. The energy eV, of 
a conversion photoelectron is related to the energy hv of a pho- 
ton of gamma rays by Einstein's photoelectric emission equa- 
tion (37.1.4): r 


eVn = hv — Dp, 


where @,, is the work function of an electron from the nth elec- 
tron shell of'the atom, numerically equal to the energies of 
electrons located at definite energy levels in the atom (39.1.5). 
These energies are known from data on characteristic X-ray 
spectra of atoms (39.7.9). A gamma photon of the energy hv 
can remove an electron from any inner shell (K, M, etc.) of 
the atom (39.3.6). Accordingly, the energies eV n of the conversion 
electrons are expressed by 


eV = hy —- Pr, eVarp =hv—ODy, eVy = hv — Dy, ete. 


where Pr, Dy, Py, etc. are the X ray energy levels of the 
atom. 

42.7.4 The result of internal conversion (42.7.3) is the loss of 
electrons from the inner shells of the atom and, consequently, 
the provision of conditions for producing the X ray character- 
istic spectrum (39.7.9). Internal conversion is accompanied by 
the emission of characteristic X rays. A measurement of the 


4l 
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energy of the conversed electrons and the data on the quanti- 
ties Dz, Pm, Dy, etc. (42.7.3) enabled it to be shown that the 
nucleus can emit a definite series of monochromatic gamma rays, 
i.e. that gamma rays have a line spectrum. Besides, these data 
showed that gamma rays are emitted by a daughter nucleus 
rather than a parent one. 

42.7.5 Gamma rays have a very strong effect on matter, in 
particular, on biological substances. The effect of gamma rays 
and other kinds of ionizing radiation is evaluated by the absor- 
bed radiation dose D, the ratio of the energy of radiation to the 
mass of the irradiated matter. The unit of the dose is J/kg, which 
is the dose of absorbed radiation at which a mass of 1 kg of irra- 
diated matter receives 1 joule of energy of ionizing radiation*. 
This unit is called the gray (Gy). Also used is the off-system 
unit called the rad: 1 rad = 10-2 Gy. 

The dose rate N of radiation is the absorbed dose D referred to 
unit time. Thus 


N = D/t. 


The unit of dose rate is a watt per kilogram: W/kg = Gy/s. 
42.7.6 The energy characteristic of radiation, evaluated on 
the basis of the ionization of dry atmospheric air, is the exposure 
dose De. It is measured in coulombs per kilogram (C/kg). One 
coulomb per kilogram of exposure dose of X or gamma radiation 
is the amount at which the sum of the electric charges of ions 
of the same sign, produced by the electrons released by irradia- 
ted air of 1 kg mass, fully utilizing the ionizing capacity, is 
equal to one coulomb. An off-system unit of exposure dose is the 
roentgen R, with 1 R = 2.58 x 10-4 C/kg. 

With an exposure dose equal to 1 R, a total charge of ions of 
the same charge equal to (4/3) X 10-9 C is obtained in 10-6 m? 
of dry air at standard atmospheric pressure. 

The exposure dose rate N, is the exposure dose per unit time: 
Ne = Det, measured in amperes per kilogram (A/kg). One 
unit of exposure dose rate is the electromagnetic radiation at 
which the exposure dose increases by one coulomb per kilogram 
in one second. The off-system units of the exposure dose rate 


* Owing to the specific nature of information on the biological 
effect of gamma rays, the pertinent units of physical quantitics 
are given here instead of in the Appendix. 
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are: 1 R/s = 2.58 x 10-4 A/kg, 4 R/min = 4.30 x 10° A/kg 
and 4 R/h = 7.17 X 10-8 A/kg. 

42.7.7 A radiation dose can also be evaluated on the basis of 
its biological effect. The unit applied for this purpose is called 
the roentgen equivalent man (rem). This is the absorbed energy 
of radiation that produces the same biological effect as that pro- 
duced by one roentgen. Thus 


1 rem = 10-2 J/kg. 


The safe dose rate for humans is considered to be one about 
250 times that produced by the background of space and by radio- 
active radiation from the earth’s interior. 


42.8 Mossbauer Effect 


42.8.1 All excited energy levels of the nucleus have energy 
values determined to an accuracy of AZ, which is determined 
from the indeterminacy relation (38.6.6). Thus 


h 
BERERE 


where Aż is the lifetime of the nucleus in the excited state. 
Only for the ground state of a stable nucleus is At == œ and 
AE = 0, i.e. the energy value of the nucleus is exactly equal 
to E*. For example, a nucleus of iridium !%!Ir goes over from 
the excited state with the energy Æ = 129 keV to the ground 
state, emitting a gamma photon, during the time At, which 
can be taken equal to the half-life (42.4.5) 7 = 10-1! s. The 
energy indeterminacy AE is found to equal AE x 5 x 10-8 eV. 
The finite lifetime of the excited energy states of the nucleus 
leads to a lack of monochromaticity of the gamma radiation 
that accompanied the transition of the nucleus from the excited 
to the ground state. This lack of monochromaticity is called the 
natural linewidth of gamma radiation, and the indeterminacy 


* This is equally true for the energy levels of electrons in atoms. 
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AE of the energy of an excited state is called the natural width 
I of the energy level of the nucleus*. 

42.8.2 Resonance absorption of gamma rays by nuclei implies 
the absorption by the nuclei of gamma photons of such frequency 
v that the energy hv of a photon is equal to the difference in the 
energies of one of the excited states and the ground state of the 
nucleus. The name means that the line of the gamma photon, 
emitted in the transition of the nucleus from an excited state 
to its ground state, has the same frequency v. 

When gamma photons are emitted or absorbed by a nucleus, the 
recoil of the nucleus is taken into account. In a transition of a 
nucleus from an excited state with the energy E to the ground 
state (of an energy taken equal to zero), the gamma photon 
acquires the energy Ep» equal to 


Wen = Eph = E — EN < E, 


where Ey is the nuclear recoil energy. 

In exciting the nucleus and its transition from the ground state 
to the state with the energy E, the gamma photon should have 
the energy Ew equal to Avaya = Ely = E+ En > E. 

The frequencies at the maxima of the emission lines (Vem) and 
absorption lines (vgps) are shifted with respect to one another 
by the amount 


Vabs — Vem = Av, which is such that hAv = 2En. 


The energy /y of nuclear recoil can be determined from the 
momentum ppn of the photon (37.2.2), which, in the processes 


of emitting and absorbing a gamma photon, should equal the 
momentum of the nucleus (Pph = PN): 


n= = tHe = (4 2MyN ’ 


where My is the mass of the nucleus. For the nucleus !8Ir, 
with the energy E = 129 keV of its excited state, calculations 
yield Ey = 0.05 eV and a shift Av = 2Fn/h of the maxima of 


* This definition also holds for transitions of atoms from an 
excited to the ground state. But here the lack of monochromatici- 
ty is called the natural spectral linewidth, and the indeterminacy 
is the natural width of energy levels of electrons in the atom. 
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emission and absorption lines. At this, hAv = 0.1 eV, which 
considerably exceeds the natural width of level F (42.8.1). 
42.8.3 A drastic reduction in the energy of nuclear recoil in 
absorbing and emitting gamma rays is evident when these pro- 
cesses are observed in nuclei in the crystal lattice, i.e. in the 
bound state. Under these conditions, the momentum and energy 
are imparted to the crystal lattice as a whole, rather than to the 
single nucleus that emits (or absorbs) a gamma photon. The 
mass of the crystal is substantially greater than that of a nu- 
cleus, and the loss of energy Eyn in emission or absorption of gam- 
ma rays becomes extremely small. In this case, resonance ab- 
sorption and emission of a gamma photon of a strictly definite 
frequency v is observed and the linewidth is comparable to the 
natural width (42.8.1). The phenomenon of resonance absorp- 
tion (or emission) of gamma rays without recoil is called the 
Mossbauer effect. 

42.8.4 The Mossbauer effect is made use of in nuclear spectro- 
scopy to make precise measurements of the energy levels of atomic 
nuclei. In the gamma transition in °’Fe nuclei, for instance, 
with the energy of the transition Æ = 14.4 keV, the change in 
the energy of the level is determined with an accuracy to T/E = 
= 3 XxX 10-18. For the gamma transition in °’Zn with the energy 
of the transition E= 93 keV, the quantity T/E was found to 
equal 5 x 10-6. 

The Mossbauer effect has been used to check the conclusion on 
shift of spectral line frequencies in a gravitational field (6.2.1). 
As a photon travels in a gravitational field its energy is changed 
by the amount AF = -—m (ə — Pı), where m = hv/c® is the 
mass of the photon (37.2.1), and p; and @, are the potentials 
of the gravitational field at points 4 and 2 (6.2.4). The minus 
sign indicates that the increase in the energy of a photon ina 
gravitational field is at the expense of its own energy E == hy: 


hAv = —mAg. 


The relative change in frequency of a photon as it passes through 
a gravitational potential difference A@ is 
Av Ag 


y c2? 


? 


Here Ag > 0 because the potential of zero gravity of the sun 
increases with the distance from the sun. At the earth’s surface 
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it is greater than at that of the sun. Consequently, Av/v < 0 
and all frequencies of spectral lines of the sun and stars, record- 
ed on the earth, are shifted toward the red end of the spectrum. 
This effect is called the gravitational red shift. 

The Mossbauer effect enabled the gravitational frequency shift 
of the gamma photon to be observed in its motion in the earth’s 
gravitational field. In vertical motion from the floor to the 
ceiling of the laboratory to a height of about 10 m, the relative 
change in frequency is 


Av 
V 


AQ oN E ere 
Soe ee aan cee e 


where g is the acceleration due to gravity (7.3.3). To record this 
frequency shift it is necessary to provide for resonance absorp- 
tion of gamma photons in such manner that the source and de- 
tector of the gamma rays have a relative linewidth equal to 
or less than 10-15. Then there will be no absorption if the fre- 
quency of the gamma photon incident on the nucleus differs from 
the frequency of the photon that the nucleus can absorb by the 
amount Av = 10" v. 

This experiment was conducted with two identical crystal 
sources of gamma rays, one of which could be placed at a height 
of 20 m above the other. When the detector of gamma photons 
was at the same height as the source of gamma photons, resonance 
absorption occurred. As the detector was raised 20 m the 
absorption ceased owing to the gravitational shift in frequency. 
The Doppler effect (30.6.1) was employed to restore absorption. 
At a definite velocity at which the detector and source approach 
cach other, the Doppler increase in frequency exactly com- 
pensated for its gravitational reduction, and resonance absorp- 
tion was restored. This experiment was a confirmation under 
laboratory conditions of the gravitational red shift. 


42.9 Nuclear Reactions 


42.9.1 Nuclear reactions are transformations of atomic nuclei 
brought about by their interactions with one another or with 
elementary particles (43.1.1). As a rule, nuclear reactions in- 
volve two nuclei and two particles. One pair, a nucleus and a 
particle, is said to be the initial, and the other, the final pair. 
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Nuclear reactions are represented in one of the following ways: 
A+t+a—+>B-+b or A (a, b)B, 


where A and B are the initial and final nuclei, and a and b are 
the initial and final particles in the reaction. In many cases 
a nuclear reaction can proceed in more than one way, the pre- 
ceding reaction being A + a — C + c, i.e. A (a, c) C, or some 
other version. The possible ways in which a nuclear reaction 
may proceed are called its channels. The initial phase of a reac- 
tion is called its entrance channel. 

42.9.2 A nuclear reaction is characterized by its energy Q (called 
the nuclear reaction energy), which is equal to the difference 
between the energies of the final and initial pairs in the reaction 
(42.9.1). When Q < 0, the reaction proceeds with the absorp- 
tion of energy and is said to be endothermic (or endoenergic); 
when Q > 0, the reaction proceeds with the release of energy 
and is said to be exothermic (or exoergic). An endothermic nu- 
clear reaction is found to be feasible at a certain minimum 
(threshold) kinetic energy Eth of the nuclei or particles causing 
the reaction: 


1 Ma+Ma 

Eth = Ma Q, 

where Ma is the mass of the stationary (target) nucleus, and 
M, is the mass of the particle (or nucleus) bombarding the 
target nucleus. 

Nuclear reactions comply with the laws of conservation of ener- 
gy, momentum, electrical charge and mass number. If the kinet- 
ic energy of the particles entering a reaction is sufficient for 
the creation of a nucleon-antinucleon pair (43.5.6), the mass 
number may change. Besides, nuclear physics has certain con- 
servation laws not found in other branches of physics (see Sects. 
43.4.1 through 43.4.4). 

42.9.3 The efficiency of a nuclear reaction is determined by 
the magnitude of the effective cross section o of the given reac- 
tion (reaction cross section). Quantity o has the dimension of 
area and characterizes the “yield” of the reaction per particle 
irradiating the nucleus. Thus 


= dno 
ie no No dx ` 
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It is assumed in this definition that a plane-parallel stream, 
containing no particles is incident in unit time on unit area of 
the cross section of a substance containing N, nuclei in unit 
volume; dno is the number of the particles that undergo the 
nuclear reaction in a layer of the thickness dz. 

42.9.4 Depending upon the nature of the interaction of particle 
a with the target A, distinction is made between direct interac- 
tion, in which nuclear reactions proceed in a single stage accord- 
ing to some process (42.9.1) and nuclear reactions that pro- 
ceed in two stages with the formation of a compound nucleus. 
In the first stage the incident particle is captured by the target 
nucleus and the particle’s energy is not imparted to any partic- 
ular nucleon, but is shared equally by all the particles of the 
compound nucleus. Thus none of the particles gains sufficient 
energy to be ejected from the nucleus*. The compound nucleus 
is dealt with as an excited statistical system of particles, having 
random motions similar to those of particles in a drop of liquid. 
Such rapid redistribution of energy between the particles of 
the nucleus is possible only upon frequent collisions of the par- 
ticles, this being typical for the redistribution of energy between 
the particles in a drop of liquid. As the result of chance deviations 
from uniform distributions of the excitation energy amongst 
the particles of the compound nucleus, sufficient energy is con- 
centrated on one of them to be ejected from the nucleus. This 
second stage of the nuclear reaction occurs after a lapse of time 
(107 to 108) ty following the first stage, where ty is the charac- 
teristic nuclear time (43.2.6). 

The sequence of a nuclear reaction proceeding with the formation 
of a compound nucleus is as follows: 


ALY |. Ag Ag L 


where ZX is the initial target nucleus, a is the incident particle, 


aay is the compound nucleus, ZC is the product nucleus and b 


is the particle ejected from the nucleus as a result of the reaction. 
Ifa = b, particles are being scattered by the nucleus (scattering 
is elastic or inelastic according to whether the energy of the 
particles is the same or different before and after scattering). 


* The “missile” captured by the nucleus, for instance, an alpha 
particle or deuteron, can consist of several particles. 
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If b = aa nuclear reaction is proceeding in the true sense of the 
term. 

42.9.5 Nuclear reactions. are classified according to various 
of their features: according to the energy of the particles that 
initiate them, according to the energy of the particles partici- 
pating in them, and according to the nature of the nuclear trans- 
formations that occur in them. Nuclear reactions at low ener- 
vies (of the order of 1 eV) proceed mainly with the participation 
of neutrons. Reactions at medium energies (up to several MeV) 
are induced, in addition to neutrons, by various charged par- 
ticles (alpha particles, protons, deuterons, nuclei of heavy hyd- 
rogen, etc.), as well as gamma photons. Charged particles that 
cause nuclear reactions may be multiply charged ions of heavy 
chemical elements, as well as charged particles accelerated in 
accelerators. Reactions at high energies (hundreds and thou- 
sands of MeV) lead to the creation of elementary particles (me- 
sons, hyperons, etc.) that are absent in the free state. 

42.9.6 The following are examples of nuclear reactions initiat- 
ed by alpha particles and deuterons ?D: 

(a) historically, the first nuclear reaction for the transformation 
of nitrogen into oxygen was the following: 


HIN + 4He — ('3F) > 40+ ip, 

or in shorter notation: 

N (a, p) +30; 

(b) the nuclear reaction in which neutrons were first obtained 
was: 

{Be -}- dhe — 12C -4-4n; 

(c) nuclear reactions initiated by deuterons 7D, the nuclei of 
heavy hydrogen (deuterium), can lead to the fusion of heavy 
nuclei, tritium 3H, or the light isotope of helium 3He, with the 
formation of a proton or neutron: 

2D -2D -> 3H -Htp and 2D-}2D -> 3He+-in 


(sce also 42.9.13). 

42.9.7 The action of neutrons jn leads to the formation of 
artificial radioactive isotopes, such as radiocarbon MC with 
a half-life (42.4.5) exceeding 5000 years: 


HN -in > YC Ip. (e) 


652 STRUCTURE AND PROPERTIES OF ATOMIC NUCLEI [42 


Subsequent decay proceeds according to the process 
AC yt: iy 
gC — tN + _%e-+ Sve, 


where _{e and $v, represent the electron and clectron antineu- 
trino. The long half-life of 14C is the basis for the radiocarbon 
(carbon-14) dating technique used in archacology. By deter- 
mining the relative amount of undecayed radioactive carbon, 
which ceased to be accumulated in a living organism when it 
died, it is possible, according to reaction (*), to establish the 
time when the organism no longer absorbed the isotope 14C from 
the atmosphere. This isotope is formed in the atmosphere from 
nitrogen by the action of cosmic-ray neutrons. 

42.9.8 Slow neutrons are ones whose de Broglie wavelength 
(38.1.4) exceeds the nuclear radius R: 4 >> f/mv and, accordingly, 
v<h/mR. Their energy does not exceed 100 keV. Neutrons 
with energies up to 0.5 eV are called thermal neutrons. As such 
neutrons pass through matter they undergo capture, the absorp- 
tion of the neutrons by the nuclei. If the energy of the neutrons 
coincides with that of the compound nucleus (42.9.4), resonance 
absorption (resonance capture) of the neutrons occurs. This 
process is the basis of the technique for obtaining transuranic 
chemical elements. The transuranic element neptunium *38Np 
is formed upon resonance capture of neutrons by the most abun- 
dant isotope of uranium 238U according to the reaction 


233] 14 39 —> 239 
§2U + gn > 793U a3 NP, 


23 min 


where the half-life of radioactive isotope *33U is indicated, 
which is transformed into ?33Np. Next, the nucleus of isotope 
239Np is transformed into plutonium 23?Pu: 


239 — > 239 

DNP 3 9 dave pa P 

Owing to its efficient fission by the action of thermal neutrons 
(42.9.10), plutonium plays a prominent role in the production 
of nuclear energy. Plutonium ?}?Pu is radioactive by alpha par- 
ticle emission and has a huge half-life (24 000 years). It is trans- 
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formed into the stable isotope of uranium 233U according to the 
reaction: 
(9A 


Hipi e m. 
n 2.4X104 years * 

42.9.9 Nuclear reactions using accelerated nuclei of chemical 
elements enabled advancement in Mendeleev’s table up to the 
element with the number Z == 107. The element with Z = 105 
was named nilsbohrium (Ns). Upon bombarding the isotope 
“47Pu with nuclei of neon 22Ne a chemical element with Z = 104 
(kurchatovium 269Ku) is obtained. After capture of the nucleus 
Ne by plutonium, the compound nucleus of kurchatovium 
1o4 Mu is formed. Then, in one out of ten thousand million com- 
pound nuclei, the nucleus of the element 289Ku was produced 
after emitting four neutrons. Thus 


242 2 G 1 
24 Pu + 22Neo — 264Ku — ?280Ku + 4in. 


In the most powerful beams of accelerated neon nuclei, one nu- 
cleus of kurchatovium is created in several hours. 

42.9.10 A heavy compound nucleus, excited in the resonance 
capture of a neutron, can divide into two approximately equal 
parts (fission reaction of heavy nuclei). The parts that are formed 
are called fission fragments. The instability of heavy nuclei 
against fission is due to the large number of protons they con- 
(ain which are subject to Coulomb repulsion from one an- 
other. ; 

The fissure of a heavy nucleus into two fragments is accompanied 
by the evolution of a huge amount of energy. This follows from 
a comparison of the specific binding energies of nuclei of ele- 
ments located inthe middle and at the end of Mendeleev’s period- 
ic system. The energy released in the fission reaction is pro- 
portional to the difference in the specific binding energies of 
an unstable “friable” nucleus and two “tightly packed” stable 
fission fragments. This difference constitutes 4.4 MeV per nu- 
cleon (42.2.4). In the fission of a nucleus of uranium 233U, con- 
taining 238 nucleons, the evolved energy should be of the order 
of 200 MeV. In the fission of nuclei contained in 1 gram of ura- 
nium #33U, the energy produced is 8 x 102° J or 22 000 kWh. 
Ileavy nuclei are capable of fission if they comply with the 
condition: Z?/A > 17, where Z?/A is called the fission parameter. 
This condition is complied with for all nuclei beginning with 
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silver 1{8Ag@ for which Z2/A x 20. Nuclei for which (Z2/A)er > 
>> 49 (the critical fission parameters) are entirely unstable with 
respect to fission and cannot exist. For kurchatovium (Z -= 104) 
(42.9.9). Z?/A ~ 41. Spontaneous nuclear fission is possible at 
values Z?/A < (Z?/A),,, which proceeds similar to radioactivity 
by alpha-particle emission (42.5.1), employing the tunnel effect 
(38.7.2). The half-life (42.4.5) for spontancous nuclear fission 
is from 1016 to 10!” years. 

42.9.11 At the instant they are formed fission fragments have 
a neutron excess, i.e. the number of neutrons are in excess of 
the number of protons. Excess neutrons, emitted by the frag- 
ments, are called fission neutrons. Their number may vary and 
the nuclear fission process is accompanied by neutron multi- 
plication, characterized by the average number (v) of emitted 
neutrons per fission event. For nuclei of plutonium 22?Pu and 
uranium 733U, in which fission is due to thermal neutrons, (v) 
equals 3.0 and 2.5, respectively. Fission neutrons include prompt 
(secondary) and delayed neutrons. Prompt neutrons are emitted 
coincident with nuclear fission during a time of the order of 
10-14 s. Delayed neutrons are emitted by the fission products at 
some time after fission occurs. 

42.9.12 If each of the prompt neutrons emitted in a fission 
reaction (42.9.11) interacts with neighbouring nuclei of the 
fissible substance, inducing fission in them, there will be an 
avalanche-like build-up of acts of nuclear division. This is 
called a chain fission reaction. A condition for the initiation of 
a nuclear chain reaction is the availability of neutron multipli- 
cation. 

The neutron multiplication factor k is the ratio of the number of 
neutrons emitted at a given stage of the reaction to the number 
of such neutrons in the preceding stage. The condition for a self- 
sustaining reaction is k > 1. The practical feasibility of nuclear 
fission chain reactions has been proved by the development of 
nuclear power engineering, a branch of engincering in which var- 
ious types of nuclear reactors have been designed and developed. 
Such reactors are devices in which controlled nuclear chain reac- 
tions are carried out. 

42.9.13 The reaction of the fusion of nuclei of tritium or helium 
from nuclei of deuterium, discussed in item (c) of Sect. 42.9.6, 
is a second method of producing intranuclear energy in addi- 
tion to the fission of heavy nuclei. The specific binding energies 
(42.2.4) in three nuclei—2D, 3H and 3He--are approximately 
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in the ratio 4 : 3 : 6. This means that the nuclear reactions dis- 
cussed in Sect. 42.9.6 are accompanied by the evolution olf 
great amounts of energy: 4.04 MeV in the first reaction and 3.27 
MeV in the second. Still more energy, 17.58 MeV, is released 
in the reaction 


2D+3H > 4He+ in. 


Per particle this energy is equal to 17.6/5 MeV = 3.5 MeV, 
i.c. approximately four times as much as in the fission of ura- 
nium 238U (42.9.10): 200/238 MeV = 0.85 MeV. Even more 
efficient with respect to the specific evolution of energy is the 
fusion of nuclei of helium 4He from four protons. This energy 
amounts to 6.70 MeV per particle. 

42.9.14 The fusion reaction of light nuclei, requiring the over- 
coming of the potential energy of their repulsion, can be effi- 
ciently carried out at super-high temperatures of the order of 
108 or 109 K, which exceeds the temperature in the central re- 
gions of the sun (T = 1.3 x 107 K). Such reactions are called 
thermonuclear reactions and they take place in matter in a plasma 
state (22.6.1). Thermonuclear reactions are evidently the sources 
of energy of the stars, compensating for their loss of energy 
through radiation. The sun radiates 3.8 X 107° J of energy per 
second. This corresponds to the evolution of energy per unit 
mass per second of only 1.88 x 10-4 J/s-kg. This is only one 
per cent of the specific energy production of a living organism 
in the process of metabolism. 

It is thought that thermonuclear reactions take place on the sun 
in the form of thermonuclear chains, or cycles, in which energy 
is released by the transformation of hydrogen nuclei into he- 
lium nuclei. 

One of the versions of the proton-proton chain, or deuterium 
cycle, begins with combination of two protons to form deute- 
rium with the emission of a positron and an electron neutrino: 


iP+4P > {D-+ +e- ove. 
The chain continues according to the reaction 
iD-}-{p — 3He+y 


where gamma radiation is emitted, not only as excess energy, 
but in the combination of the positron with electrons, which 
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always exist in plasma (43.5.4). A probable continuation of the 
chain is a reaction with the evolution of energy: 


’He + 3He > 4He+ 2ip, 


where $He is the symbol for the alpha particle. 

In the carbon-nitrogen cycle, the carbon nuclei are “catalysts” 
in the reaction for the fusion of hydrogen nuclei into a helium 
nuclei. 

At the beginning of the cycle a fast proton penetrates a carbon 
nucleus: 


C- p> FN -y 


The radioactive isotope of nitrogen 13N, with a half-life of 
14 min, decays (42.6.1): jp — ¿n + +e + $v, and a nucleus 
of a carbon isotope is formed: 


13 13 0 04; 
SN — 13C + te H Ove. 


Approximately after cach 2.7 million years, a nucleus of !3C 
captures a proton, forming the nucleus of the stable nitrogen 
isotope 14N: 


C-A- 4P > tN +y. 


On an average of 32 million years later, the nucleus of 14N cap- 
tures a proton and is transformed into a nucleus of oxygen 150: 


IN ip > t30- Y. 


The unstable nucleus of 150, with a half-life of 3 min, emits a 
positron and a neutrino as it is transformed into the nucleus 
of HN: 


130 —> 15N — fe -H Bve. 


The cycle is completed by a reaction that takes place approxi- 
mately after 100 thousand years: 


EN 4- ip > C+ $H. 


The result of the cycle is the transformation of four protons into 
a helium nucleus with the emission of two positrons and gamma 
radiation. The relcased energy amounts to 26.8 MeV per helium 
nucleus. This corresponds to an energy output of 700 thousand 
kWh per gram-atom of helium. This energy is sufficient to com- 
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pensate for the energy the sun loses in radiation. Various stages 
of the cycle are separated by tremendous lengths of time when 
judged according to the time scale on the earth. But this cycle 
is closed and continuous. Hence, all the stages of the cycle 
proceed simultaneously on the sun, having begun at various 
instants of time. 

42.9.15 Conditions, close to those prevailing in the interior of 
the sun, were provided in a thermonuclear, or fusion, bomb 
(more commonly called the hydrogen, or H, bomb). In this bomb 
a self-sustaining thermonuclear fusion reaction of an explosive 
sa is initiated in a mixture of deuterium and tritium. 
Thus 


iD-+ {7H > ¿He + gn. (*) 


The high temperature required for a thermonuclear reaction 
was obtained by the explosion of an “ordinary” atomic bomb, 
operating on the principle of a rapid chain reaction in the fis- 
sion of heavy nuclei. 
The theoretical basis for controlled thermonuclear fusion is a 
reaction of the type 


2D-+3H > $He-+ dn, * 
as well as of the type 
2D-- 2?) — 3H -ip or 2D-H2D -> 3He-|- in. (++) 


The feasibility of these reactions reduces to the necessity to 
meet two requirements: the availability of a certain least tem- 
perature and a definite restriction of the product nt: 


nt > 1016 and T > 10° K for reaction (++), 
nt > 1044 and T > 108 K for reaction (+). 


Here n is the concentration of matter in the plasma, and tT 
is the time the particles are confined in the plasma. The pre- 
ceding inequalities indicate that the number of nuclei reacting 
in thermonuclear fusion should be proportional to the concen- 
tration of matter and the time of existence of the nucleus. 
The practical realization of controlled thermonuclear fusion 
reactions is a timely problem of enormous importance and its 
solution should be reached in the near future. 


42-0186 
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CHAPTER 43 ELEMENTARY PARTICLES 


43.1 Preliminary Information on Elementary Particles 


43.1.1 Elementary, or fundamental, particles are ones to which 
an internal structure that is a simple combination of other stable 
particles cannot be attributed at the present state of knowledge. 


TABLE 43.1 
Particles and Charge,| Mass, Tay 
antiparticles Symbol e MeV Lifetime, s 
Photon 0 | 0 own | | o | o [sume | 


electron neutrino 
and antineutrino 


mu-meson neutrino 0 OF 0 Stable 


y y 
u Vu 
and antineutrino 


Electrons: 


electron and positron —1 +4|0.5411] Stable 


Muons: 
u` meson and pt me- 
son 


—1 +41 106 | 2.2x40-6 


Mesons: 

Pions: 

m+ meson and nm mce- 
son 


n’? meson 0 135 0.8 10-16 


Kaons 
K+ meson and K- me- 
son 


K+ K- 


Leptons 
Neutrinos: 
Eee pa em +41 —1| 494 1.2 XxX 10-78 


Ve Ve 0 QO Stable 


+4 —4| 140 | 2.6x10-8 keaz 
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In its interaction with other particles and fields, an elementary 
particle behaves as an integral unit. The term “elementary” 
particle is essentially of a “conditional nature. 

43.1.2 In energy regions less than 2mọc? (where mọ is the rest 
mass of the particle), the structure of particles has no effect on 
interaction between them and other particles and fields. In 
these cases elementary particles are dealt with as particles in 


aR T Baryon |Strange-| Iso- Projection Dominant decay 
charge ness spin of isospin mode 

1/2 | 90 0 

1/2 | 0 0 

1/2 | 0 0) 

1/2 10 O U7 —> e7 -vu -Hye 


0 0 00 0 1 +1 —1 mt — ptt vy 
0 0 0) 4 0 19 —> 2y 


0 O O |+4 —4| 1/2 | +1/2 —1/2 | Kt pt-+ vy 


42% 
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Particles and 


antiparticles Symbol 
K® meson, anti-K® | K° Ke 
meson 
1° meson n° 
Baryons 
Nucleons: 
proton and antipro- pP 
ton a 
neutron and anti- nn 
neutron 
Hyperons: 7 
A® hyperon and anti- | A® A® 
A® hyperon R 
2t hyperon and anti- | 2+ X- 
2+ hyperon K 
=~ hyperon and anti- | =- xt 
2- hyperon 
20 hyperon and anti- | 2° yo 
2° hyperon 


2- hyperon and anti- | 2- 
H- hyperon 
a0 hyperon and anti- 
a0 hyperon 
Q- hyperon and anti- 
Q- hyperon 


Q- Q+ 


—1 -4| 1672 
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Charge, ps Lifetime, s 

K9 : 0.86 X10710 

O 0j 498 |K? :5.38x10-8 

4+4 —1/938.2| Stable 

0 01939.6} 0.93103 
0 Oo] 4146] 2.5x40-10 
+4 —4| 1189 | 0.810710 
—1 44) 1197} 1.5 40-20 
0 Oo} 1492 < 1071 
—1 4| 4324 | 1.7x10710 
0 o| 1315 3x10-10 
1.3 10-10 


qE 
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Dominant decay 
mode 


Iso- 


Baryon 
spin 


charge 


Strange- 
ness 


Projection 


Spin, 
h of isospin 


K > 2x 
TUL 

0 0 O41 —1) 1/2 | —1/2 41/2 | K? 3x 

ev 

Ze 

0 0 0 0 0 n° —>3n° 
N'A n? 


1/2 +4 —4 O0 Of 1/2 | +4+1/2 —1/2 


1/2 |4+4 —14 0° 0| 1/2 | —1/2 +4/2 | n— p+ e-+ ve 


12 l1 1—1 +4] 0 0 O [A0> p+r 
1/2 41 1—1 +4) 1 |414 —1 [E >N+n 
4/72 |44 1 +4) 14 a 44 en 
1/2 |44 1—1 +4) 1 0 0 [E> Ay 
1/2 |+4 —4}-2 +2} 1/2 | —1/2 +1/2 |E- > A0 +a 


4/2 |44 —1J-2 +2] 1/2 | 44/2 —1/2 | 20 -> A04 n0 


3/2? |44 —41-3 +31 0 0 0 |Q--+8-+n0 
AO-+ K- 


662 ELEMENTARY PARTICLES [43 


the sense of material points, but possessing many properties, 
such as: rest mass, electric charge, spin and a dominant decay 
mode (Table 43.1). The concept of a point elementary particle 
meets the requirements of relativity theory. An extended par- 
ticle of finite dimensions, which is an integral whole, should 
not be deformable as otherwise independent movements of 
its parts would be possible. But this would mean that external 
action on such an elementary particle would be instantaneously 
transmitted from certain parts to others, and this contradicts 
relativity theory (5.1.1). 

43.1.3. A structureless charged elementary particle has an in- 
finite potential (16.2.6) and infinite potential energy. Up-to-date 
particle physicists are working out various methods of intro- 
ducing nonpoint-like interaction. These methods eliminate the 
“infinity” of point elementary particles, and introduce definite 
structures for them without contradicting relativity theory. 
It follows from the indeterminacy relations (38.6.2) that a 
study of the structure of elementary particles requires the inves- 
tigation of processes taking place at the extremely short distances 
ô that the momentum indeterminacy Ap > ii/6 corresponds 
to. The shorter the distances 6, associated with the structure of 
elementary particles, the greater the momentum p of the par- 
ticles must be, because it cannot be less than Ap. Consequently, 
the structure of elementary particles can be cleared up only by 
research involving high energies. For this reason, particle phys- 
ics, the new branch of physics engaged in the study of elementa- 
ry particles, is now more commonly called high-energy physics. 


43.2 Classification of Elementary Particles and Their 
interaction 


43.2.1 There are several groups of elementary particles which 
differ in their properties and the nature of the interaction be- 
tween one another. 

The main classification of elementary particles is based on 
their various rest masses: distinction is made between leptons 
(light particles), mesons (intermediate particles) and baryons 
(heavy particles), with the last being further subdivided into 
nucleons and hyperons. 

43.2.2 As to their clectric charge, measured in units of the 
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elementary charge e (14.1.2), particles may be positive, nega- 
tive or neutral. The existence of particles with fractional elec- 
tric charges, called quarks, has been predicted. 

With respect to their spin (26.1.3), elementary particles are 
subdivided into ones with integer, half-integer and zero spin 
(in units of %). Depending upon their spin, assemblies of par- 
ticles comply with different quantum statistics (41.2.4 and 
41.2.4). 

43.2.3 Elementary particles are either stable or unstable. This 
classification characterizes their lifetime and the dominant 
decay mode. The characteristics of elementary particles are listed 
in Table 43.1. The rest masses of the particles are given in 
energy units, as is now customary practice, on the basis of the 
relation Æ = me? (5.7.2). Since the rest mass of the electron 
is approximately 0.5 MeV, the mass of a particle in electron 
mass units can be found by multiplying its mass in MeV by 
two. 

43.2.4 In addition to the elementary particles listed in Ta- 
ble 43.1, there are also a great many particles called resonances 
(resonons and resonance particles). Their lifetimes are character- 
istic of strong interaction (43.2.5); they have definite proper- 
ties, energies and momenta, and definite decay modes. These 
features enable resonances to be regarded as special particles. 

43.2.5 Mainly three of the four known kinds of interaction exist 
between elementary particles: strong, electromagnetic and weak. 
Gravitational interaction between elementary particles is usu- 
ally not considered. Each of the interactions is characterized by 
its interaction coupling constant, determining its comparative 
magnitude, and its characteristic interaction time (time con- 
stant of interaction). 

43.2.6 Strong interactions refer to processes involving baryons, 
as well as pions and kaons (Table 43.1). Strong interactions 
result from nuclear forces between nucleons (42.3.1), as well 
as the processes of creation and decay of mesons and hyperons 
in nuclear reactions at high energies. Strong interaction is char- 
acterized by the Fermi constant B = f?/ħe ~ 1, where the con- 
stant f has the dimension of an electric charge. Processes that 
display strong interaction are said to be fast and have a charac- 
teristic nuclear time ty. In order of magnitude this time equals 
the Compton wavelength of the nucleon h/mpe (where mp is the 


mass of the nucleon), divided by the velocity of light in free 
space, i.e. hl tye”. | 
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Particles that take part in strong interactions are called hadrons 
(large and heavy particles). 

43.2.7 Electromagnetic interactions characterize processes that 
take place between particles that have electric charges. For 
instance, Coulomb repulsion in nuclei, and the processes of 
creation and annihilation of electron-positron pairs are asso- 
ciated with electromagnetic interaction. A measure of this 
interaction is a dimensionless constant called the fine-structure 
constant: a = e?/ħe = 1/137. It has been thus named because a 
determines the magnitude of fine splitting of the atom’s energy 
level into a number of sublevels. This is caused by the effect 
of the relativistic dependence of mass on velocity (5.6.1) on 
the electron’s energy in the atom. Processes accompanying 
electromagnetic interaction are called electromagnetic processes 
and they proceed during a time of the order of 10-7 to 10-18 s 
in accordance with the fact that a/B ~ 10-2 (43.2.6). Elec- 
tromagnetic interaction is the only one of the interactions of the 
microcosm that is also manifested in macroscopic phenomena 
and processes. 


TABLE 43.2 


Interaction | yeas Interaction time, s 
Strong 4 10-23 to 10-22 
Electromagnetic 1/137 10-29 to 10-18 
Weak 10-14 10-29 to 1078 


43.2.8 Weak interactions are processes involving leptons (Ta- 
ble 43.1). Examples of such processes are beta decay of nuclei 
(42.6.1), and interaction between nuclei and muons, clectrons 
(or positrons), and neutrinos (or antineutrinos). 

Weak interaction is characterized by the weak interaction con- 
stant G, having the dimension of the inverse square of mass. 
The dimensionless weak interaction constant g? = Gm2 x 10-5, 


where mp is the mass of the proton. Weak interaction is the 
slowest of the interactions manifested in the microcosm and 


43.3] VARIOUS ELEMENTARY PARTICLES 665 


proceeds during time of the order of 10-1 s. A comparison of 
the various kinds of interaction between elementary particles 
is given in Table 43.2. 


43.3 Certain Information on Various 
Elementary Particles 


43.3.1 Various kinds of elementary particles (Table 43.1) are 
produced in the interaction of cosmic rays with the nuclei of 
the nitrogen and oxygen atoms in the atmosphere. Since the 
development of modern accelerators, the creation, mutual trans- 
formations and structure of elementary particles are investigated 
in the laboratory. 

43.3.2 The feasibility of creating new particles in the interac- 
tion between protons and nuclei follows from the relation between 
mass and energy (5.7.2). When an energy of 104 GeV is imparted 
to the proton, approximately 10+ times more than its rest energy 
mpc? (where my is the rest mass of the proton and cis the velocity 


of light in free space), a collision between the proton and a 
nucleus leads to splitting of the latter. High kinetic energy is 
imparted to the fission products and new particles are created, 
some having a rest mass and others without one. 

43.3.3 Muons (mu mesons) are the electrically charged parti- 
cles u+ and u- with a rest mass equal approximately to 200 mg, 
where meis the rest mass of the electron. The main characteris- 
tics of muons are listed in Table 43.1. Muons are not stable, 
and decay according to the modes: 


ut -—> +e + bVe+ 8u and u~ > [e+ ove +- Vu, 


where _°e and łe are the electron and positron, ĉvęe and $v, 


and §v 


are the electron neutrino and antineutrino, and $v i 


are the muon neutrino and antineutrino. 
The muon neutrino (or antineutrino) is a particle having nei- 
ther rest mass nor electric charge, and which is emitted in the 
creation and decay of mesons. These particles differ from the 
neutrino (or antineutrino) emitted together with electrons in 
beta-decay processes (42.6.1). Therefore, the two types of neu- 


u 
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trino (and antineutrino) are designated by different symbols: 


Ave (Ve) and vu (Vu). The spin of the mu mesons equals + 4/2 
and the muon decay reaction complies with the conservation 
laws (42.9.2). The interaction of muons with nuclei is an exam- 
ple of weak interaction (43.2.8), and muons are nuclearly inac- 
tive particles. 

43.3.4 Pions (pi mesons) are neutral (n?) and electrically charged 
(n+ and x`) particles with a rest mass of approximately 300 mg, 
where m, is the rest mass of the electron. The main character- 
istics of pions are listed in Table 43.4. Pions, like muons, are 
unstable and decay according to the modes: 


nt—>utteovi, 1 —> u -H8vu and m+ yty. 


In these decay reactions u+ and u- are positive and negative 


muons, $v, and jv, are the muon neutrino and antineutrino, 
and y is a gamma-ray quantum (photon). Shown schematically 


mh 


Fig. 43.4 Fig. 43.2 


in Fig. 43.1 is the sequence for the (n-u-e) decays. Pions are spin- 
less particles. This follows from their decay reactions and the 
ee that these reactions comply with the conservation 
aws. 

43.3.5 Pions are formed in cosmic rays as the result of the dis- 
integration of the nuclei of atmospheric gas atoms by fast 
protons and alpha particles. To produce pions the energy of 
the protons should be about 300 MeV. 

The arrangement for obtaining pions in the laboratory is illus- 
trated in Fig. 43.2. Target A of beryllium or carbon is bom- 
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barded by fast protons ip. This knocks pions out of the target 
at arbitrary angles. The magnetic field of the accelerator causes 
the ejected pions to follow circular paths whose radii are deter- 
mined by the velocities of the pions. Pions ejected in the for- 
ward direction are divided into two beams, the negative ones 
being deflected outside the accelerator chamber and the positive 
ones inside. The opposite is true for pions ejected in the back- 
ward direction (not shown in Fig. 43.2). 

The masses of charged ions are determined by their energies 
and momenta by methods employing the deflection of particles 
in magnetic and electric fields. The lifetime of pions (Table 43.1) 
is established by measuring the time intervals between the 
instant of creation of a pion and its instant of decay (43.3.4). 
Pions are nuclearly active particles. Their interaction with 
nuclei is an example of strong interaction (43.2.6). 

Similar methods are applied for investigating other charged 
particles. Since neutral particles are not deflected in a magnetic 
field, they are investigated by means of the conservation laws 
and interaction reactions of the given particle with other parti- 
cles. For example, the reaction of a negative pion with a proton 
proceeds according to the mode 


TT- ip -> m+ in, ° 

The negative pion is transformed into a neutral one, and the 
proton into a neutron.The neutral pion n° decays into two gamma- 
ray quanta (43.3.4). The mass of the neutral pion (Table 43.1) 
is determined from the known masses and energies of the proton, 
neutron and negative pion, from the energy of gamma-ray quan- 
ta, as well as from data on the momenta of particles in these 
transformations. 

43.3.6 The principle of applying the law of conservation of 
momentum for studying the transformations of elementary par- 
ticles can be illustrated by the example of the B- decay in which 


an electron antineutrino 9 Ovo is created (42.6.1). If only a single 
electron were ejected by the nucleus in a B- decay, the nucleus 
would be subject to recoil in the direction directly opposite to 
that of the ejected electron. The momentum of the nucleus 
that was stationary before the decay reaction would be equal 
in magnitude but opposite in direction to that of the ejected 
electron. If, however, in addition to the electron, the nucleus 
also ejects an antineutrino, the vector sum of the three momenta, 
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that of the electron, antineutrino and recoil nucleus, should, 

according to the law of conservation of momentum (2.7.1), 
be equal to zero, as before the decay reaction (Fig. 43.3). 

43.3.7 Kaons, or K mesons, are neutral (K?) and charged (Kt 

and K-) particles with rest masses close to 1000 me, where me 

is the rest mass of the electron. As 

other mesons, kaons are unstable, but, 

in contrast to the decay reactions of 

pions and muons (43.3.3 and 43.3.4), 

the reaction channels of their decays 

are not unambiguous. As a result of 

kaon decay, pions, muons and neu- 

Antineulrind trinos(or antineutrino) of both types 

f are formed. Like pions, kaons are 

spinless. All of these particles (pions 


Loe) 
Q 
2. 
= 
> 
A 
5S 
A 
U 


and kaons) are united in a single group, 
< the mesons (Table 43.1). 
S 43.3.8 A large group of particles, 
& called hyperons, have rest masses 
S greater than that of nucleons, and 
range approximately from 2183 m 
Fig. 43.3 for the lambda-zero hyperon (A?) to 


3280 me for the omega-minus hyperon 

Q-). Hyperons are unstable (see 
Table 43.1). The lighter hyperons decay into nucleons and pions. 
The heavier hyperons decay into lighter hyperons, pions and 
kaons. All hyperons have a spin equal to f/2, except the Q- 
hyperon, whose spin is equal to 34/2. Nucleons (protons and neu- 
trons) and hyperons are related particles and they belong to 
a single class of heavy particles called baryons (Table 43.1). 


43.4 Certain Conservation Laws in 
Elementary-Particle Physics 


43.4.1 The differences in the rest masses of charged m* and the 
neutral n° pions, like the difference in the rest masses of the 
two states of nucleons—proton and neutron—in the nucleus, 
are the result of charge independence of nuclear forces (42.3.3) 
and of the presence of electromagnetic interactions in addition 
to strong nuclear interactions. With no electromagnetic interac- 
tions in the nucleus, charge independence of nuclear forces 
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would lead to a condition in which the proton and neutron have 
equal rest mass, and the masses of all types of pions coincide. 
When the additional electromagnetic interaction is taken into 
account, the double state of the nucleon (proton-neutron doublet) 
is split into two different states in accordance with their rest 
masses. The masses of the neutron and proton are found to dif- 
fer. For the same reason, the masses of the particles that make up 
the pion triplet are also not equal to one another. The masses 
of the charged m+ pions exceed that of the n? pion (Table 43.1). 
43.4.2 Introduced to characterize the multiplicity of states of 
certain kinds of particles are the concepts of isotopic spin T 
(isospin) and the projection Tg of isotopic spin on a certain “axis”. 
The introduction of these characteristics of elementary particles 
makes it possible to formulate special features of the properties 
and transformation of elementary particles that follow from 
charge independence in strong interactions. Isotopic spin T 
characterizes the number of particles in the given “charge mul- 
tiplet”; this number is equal to (2T + 1). Thus the isotopic 
spin of the nucleon equals T = 1/2, and that of the pion is 
T = 1, in accordance with the number of particles (nucleons 
and pions) in the doublet and triplet. 

The projection Tę of isotopic spin of the nucleon is introduced 
in such a way that it is related to the charge Z (in e units) of 
a particle that is a member of a charge doublet as follows: 


Z = T + 1/2. 


For the neutron Tg = —1/2, for the proton Tg = 1/2. 

For the triplet of pions, this relation is of a more complex na- 
ture (43.4.3) and leads to the situation in which the projection 7; 
of isotopic spin is equal to the charge of the particle (in e units): 
Tg = 0 for the n° meson, 7; = 1 for the n+ meson and 7; = 
= —1 for the m- meson. 

The law of conservation of isotopic spin states that the total iso- 
topic spin T of all the particles of an isolated system remains 
constant through all transformations caused by charge-inde- 
pendent strong interactions. 

43.4.3 The concept of a baryon or nuclear (nucleon) charge B 
can be introduced for all elementary particles. For baryons 
(Table 43.1) B = +1, for antibaryons B = —1, whereas for 
particles that do not belong to the baryon class B = 0. 

The law of conservation of baryon charge states that in all nuclear 
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transformations in an isolated system the total baryon charge 
remains constant. 

This law, like the law of conservation of electric charge, is 
valid both in strong and in electromagnetic interactions. Making 
use of the baryon charge B, the relation between the electric 
charge Z and the projection of isotopic spin Tę of a particle in 
the charge multiplets of nucleons and pions can be written as 


Z = Ty + B/2. 


For nucleons (proton and neutron) B = 1 and Z = T + 1/2 
(43.4.2). For pions B = 0 and Z = Tx. 

Since Z and B are conserved both in strong and electromagnetic 
interactions, Tę is also conserved in these interactions, whereas 
T is conserved only in strong interactions. 

43.4.4 The investigations of kaons and hyperons (Table 43.1) 
revealed certain of their unusual properties. The creation of 
these particles is brought about by strong interaction, and the 
time for their creation corresponds to nuclear time (43.2.6). 
At the same time, when they decay into nuclearly active pions, 
kaons have a lifetime typical of weak interaction (Table 43.2). 
Moreover, it was established that kaons and hyperons are al- 
ways created in pairs, but not in arbitrary combinations. Fi- 
nally, a substantial difference was found in the conditions for 
the creation of kaons and the reactions of their interaction with 
other particles. These features made it necessary to consider 
kaons and hyperons to be strange particles. A special character- 
istic of elementary particles was introduced to explain the 
behaviour of these strange particles. It is called strangeness S 
and is nonzero only for strange particles (S = +1, +2, ...) 
and zero for “ordinary” particles. For strange particles the rela- 
tion in Sect. 43.4.3 is of the form 


B+S 
Z=; +t. 


From the conservation law formulated in Sect. 43.4.3 follows 
the law of conservation of strangeness which states that the total 
strangeness of an isolated system remains constant in strong 
and electromagnetic interactions.* 


* The information given in this section does not cover all the 
conservation laws in high-energy physics. But information on the 
other laws is beyond the scope of this handbook. 
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43.5 Antiparticles 


43.5.1 The majority of elementary particles have corresponding 
anliparticles. Examples of particles and antiparticles are: the 
electron _%e and positron ,e, muons u+tand u~, pions m+ and n7, 
kaons K+ and K~, the electron and muon neutrinos, $v, and 


ovu, and antineutrinos, Ve and Yy. The rest masses, spins 
and lifetimes of particles and their antiparticles are the same. 
The electric charges of particles and their antiparticles are 
the same in magnitude, but opposite in sign. 

Particles whose properties are completely identical with those 
of their antiparticles are called truly neutral particles. They 
include the photon and the neutral mesons, n? and K°. 

43.5.2 The existence in nature of pairs of charged particles 
and their antiparticles expresses the principle of charge conju- 
gation, which states that each charged clementary particle 
must have an antiparticle. In accordance with. this principle, 


the proton has an antiparticle, the antiproton iP (Table 43.1). 
The principle of charge conjugation extends to the neutral 
particles, neutron and neutrino, as well. There is the antineu- 


tron jn, as well as the electron antineutrino 8v, and muon anti- 


neutrino v.. 

43.5.3 When a particle combines with its antiparticle, the 
energy released is not less than twice the rest energy of each of 
them. The formation of a “particle-antiparticle” pair requires 
an energy input that exceeds twice the rest energy of each par- 
ticle. This is due to the necessity to impart momentum and ki- 
netic energy to the pair being produced. 

43.5.4 The process of producing an electron-positron pair occurs 
in the collision of a hard gamma-ray quantum with some kind 
of charged particle, for instance, with an electron on the shell 
of an atom or with an atomic nucleus*. This process proceeds 
according to the reaction y — _°e + ĉe and is feasible with 


* The need for a third particle is due to the fact that in producing 
two particles with the rest mass mọ = 0 from a photon, the 
total momentum of the two particles is less than the momentum 
hv/c of the photon. Hence, a third particle is required to take 
up part of the momentum of the photon. 
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a photon energy not less than 2mec?, i.e. hv > 2mec? = 1.022 
MeV, where mec? == 0.511 MeV is the rest energy of the electron 
(or positron). It follows from the reaction for producing a pair 
by the action of a gamma-ray quantum (photon) that the spin 
of the photon should equal 0 or 1 (in units of fh), because the 
spins of electron and positron are each equal to A/2. A great 
many experimentally established facts and great deal of theo- 
‘Tetical investigation lead to the conclusion that the spin of the 
photon is equal to h. 

43.5.5 The process of uniting an electron and a positron is cal- 
led positron-electron annihilation. In the great majority of cases, 
the process results in the production of two gamma-ray quanta: 


-1e -+ 4$e — 2y. 


Before combining the particles _?e and ĉe, the total momentum 
of the two particles, in the coordinate system fixed in the centre 
of mass (2.3.3) of the electron-positron system, was equal to 
zero. After annihilation of the pair, two photons are produced 
whose momenta are in opposite directions, and their total mo- 
mentum should be equal to zero*. Each photon carries away 
energy equal to hv = m,c? = 0.511 MeV. 

43.5.6 The minimum energy required to produce a proton-anti- 
proton pair in a coordinate system where one nucleon is at 
rest is 6mpc? = 5.6 GeV. This energy is reduced to 4.3 GeV 


when the ip-_ip pair is actually produced. Considerably more 
energy is required to produce a hyperon-antihyperon pair than 
for a pair of nucleons. This is due to the comparatively large 
mass of the hyperons. For example, the production of the lightest 


antihyperon, the antilambda-zero hyperon (A®°), requires an 
energy from 5.3 to 5.8 GeV. 

43.5.7 A feature of antiparticles is their capacity to combine 
rapidly with their corresponding particles, for instance, po- 
sitrons with electrons, antiprotons with protons, and antineu- 
trons with neutrons. The reason is that the matter of which all 
of nature surrounding us is built up consists of electrons, pro- 
tons and neutrons. Antiparticles, artificially produced, inevi- 


* Annihilation with the production of three photons, comply- 
ing with the laws of conservation of momentum and energy, is 
also possible. 
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tably meet their much more numerous particles in substances, 
combine with them and cease to exist, giving rise to the produc- 
tion of new particles. 

43.5.8 The difference between neutral particles and their anti- 
particles is established by the nature of the interactions of the 
particles and antiparticles with substances. Examples arc the 


interaction reactions of the antineutrino $v, with protons {p 
and the neutrino $v, with neutrons: 


Wet {p> intsge and gve+in— tp+_fe © 


43.5.9 The difference between electron and muon neutrinos is 
established by investigating the decays of charged pions (43.3.4). 
If the neutrinos produced in these decays are separated out and 
their capture by neutrons is accomplished, then instead of the 
reaction (+) of Sect. 43.5.8, the process will be the following: 


OV + oD => ip +p, 


where u- is the negative muon. This demonstrates the differ- 
ence between the electron and muon neutrinos (and antincu- 
trinos). 


43.6 Structure of the Nucleon 


43.6.1 The structure of the nucleon (or any other elementary 
particle) is understood to mean its extent in space and what it 
consists of. The structure of any elementary particle cannot be 
treated by itself. It is related to the structure and properties 
of other particles. Special experiments for studying their struc- 
ture have been conducted only for nucleons*. Effective methods 
for investigating the structure of nucleons are elastic collisions 
of pions with protons and elastic collisions of fast electrons 
with protons and neutrons. The first method showed that the 
pion is only slightly deflected by the collision from its initial 


* The problem of the structure of elementary particles, as 
well as attempts to establish certain fundamental particles which 
all others are built up of are the frontiers of high-energy physics. 
The present handbook contains only general information on the 
structure of nucleons. 


43—0186 
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direction, and the proton received only slight recoil, so that 
the momentum Ap transmitted to the proton is very small. 
It follows from the indeterminacy relation (38.6.2) that the 
process of the collision of a pion with a proton takes place in 
a certain region of space with the linear dimensions a > hi/Ap, 
where a characterizes the size of the nucleon. 

43.6.2 The emission and absorption of particles and anti- 
particles proceed continuously in a nucleon. A nucleon is treated 
as a complex, incessantly changing 
assembly of many particles. 

At the central part of the nucleon 
(bare nucleon) is the nucleon core 
having a radius of (0.2 to 0.4) X 
x 10-15 m. In this region, heavy 
particles, resonances and nucleon- 
antinucleon pairs, play some spe- 
cial role that is not yet quite clear. 
The outer part of the nucleon 
forms the pion cloud (Fig. 43.4). 
This concept of nucleon structure 
enables the difference in masses of 
the neutron and proton to be 
cleared up, as well as the difference 
of the magnetic moment of the 
proton and neutron from uy. These differences are due to energy 
of electromagnetic interaction between the “core” of the nucleon 
and the pion cloud. 

43.6.3 The scattering of fast clectrons having energies up to 
550 MeV by protons enabled the investigation of electric charge 
density distribution in the proton as a function of the distance r 
from the centre of the “core”. It is necessary here to take into 
account the fact that the charge of the proton is indivisible and 
is always manifested as a unified whole. Therefore, the distribu- 
tion of the electric charge in the proton does not imply the pos- 
sibility of experimentally separating out a definite part of this 
charge. Shown in Fig. 43.5a is the dependence of the radial den- 
sity of charge distribution pg = dq/dr on r (measured in fermi 
units), where dq is the charge contained in a spherical layer with 
the radii r and r + dr. The curve in the figure has a sharply 
defined maximum at the distance R, = 0.77 X 10-18 m, which 
is called the “electric radius” of the proton. The area under the 
curve is equal to the charge e of the proton. The results obtained 


Fig. 43.4 
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in similar experiments in scattering fast electrons by neutrons 
are given in Fig. 43.50. inte 

Scattering proceeds as if the “electric radius” of the neutron 
was equal to zero. In interaction with fast electrons the neutron 


P, fy 


QO Rel ee 
funtt=! fermi=10 ® cm 


(a) 


Fig. 43.5 


(b) 


behaves as if its pion cloud coincides in dimensions with its 
“core”. The electric charge is negative in the inner and outer re- 
gions of the neutron; in the middle region it is positive. The 
total charge of the neutron, equal to the area under the curve, 
is zero. 

The structure of elementary particles is being intensively in- 
vestigated at the present time. 
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PART NINE APPENDICES 


I. SYSTEMS OF UNITS OF PHYSICAL QUANTITIES 


I.4 A unit of a physical quantity is an arbitrarily selected 
physical quantity having the same physical meaning as the one 
being considered. A system of units is the whole set of units, 
pertaining to a certain system of quantities and worked out 
on the basis of accepted rules. The base, or fundamental, units of 
a given system are the units of several dissimilar physical quan- 
tities, arbitrarily selected in establishing the system. The cor- 
responding physical quantities are called base quantities. A sy- 
stem of units is said to be absolute if its base physical quanti- 
ties are length, mass and time. 

Derived units are ones established from other units of the given 
system by means of physical laws that express the relations 
between the corresponding physical quantities. 

1.2 The dimensional formula of any physical quantity is an expres- 
sion characterizing the relations of the units of this physical 
quantity with the base units of the given system of units. This 
expression is a monomial in the form of the product of symbols 
representing the base units raised to the corresponding power 
(whole or fractional, positive or negative). A physical quantity 
is said to be dimensionless if in its dimensional formula all 
base units are in zero power. The numerical value of a dimen- 
sionless quantity is independent of the choice of the system of 
units. 

I.3 Prefixes and their symbols for decimal multiples and sub- 
multiples are listed in Table 1.4. 

1.4 Preferred in most countries today is the International Sys- 
tem (SI), which is based on the application of seven base units: 
metre, kilogram, second, ampere, kelvin, mole and candela, 
as well as two supplementary units, the radian and steradian. 
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The definitions and notation of these units are listed in Tables 
I.2a and I.2b. 

1.5 Derived units of the International System have been worked 
out according to equations relating the physical quantities, 
corresponding to the simplest cases of phenomena and bodies. 
The SI derived units for electric and magnetic quantities have 
been established in accordance with the rationalized form of 
electromagnetic field equations (Sects. 14.2.7 and 23.2.2). 
The most important derived units are listed in Table I.3. 

I.6 In addition to SI units, it is permissible in certain cases 
to use the CGS (for example, in scientific papers in physics). 
The CGS system makes use (within the scope of mechanics) of 
three base units: the centimetre (cm) for length, the gram (g) 
for mass, and the second (s) for time. There are also two supple- 
mentary units: the radian (rad) for a plane angle and the ste- 
radian (sr) for a solid angle. All of these units either coincide 
with the corresponding SI units or are decimal fractions of them: 
1 cm = 102 m and 1 g = 10> kg. 

The derived CGS units for mechanical quantities and their rela- 
tions to SI units are listed in Table 1.4. 

I.7 Three systems of units for electrical and magnetic quanti- 
ties, based on the CGS units for mechanical quantities, are used 
in physics. They are the absolute electrostatic (CGS-electrostatic) 
system, the absolute electromagnetic (CGS-electromagnetic) sys- 
tem and the absolute Gaussian, or symmetrical, system of units. 
In the CGS-electrostatic system, the proportionality factor k 
(14.2.4) in Coulomb’s law is considered to be dimensionless and 
equal to unity. Accordingly, Coulomb’s law and all other rela- 
tions of electrostatics and electrodynamics are not written in 
the rationalized form. 

In the CGS-electromagnetic system, the proportionality factor k 
in the Biot-Savart-Laplace law (23.2.1) is considered to be dimen- 
sionless and equal to unity. Hence, this law and all other rela- 
tions of electromagnetism are written in the unrationalized 
form. The quantity c, indicating the number of units of electric 
charge (or electric current) in the CGS-electrostatic system, is 
equal to one unit of electric charge (or electric current) in the 
CGS-electromagnetic system and is called the electrodynamic 
constant. The electrodynamic constant is equal to the velocity of 
iehi in free space: c = 2.99792458 X 1010 em/s ~ 3 x 10" 
cm/s. 
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TABLE I.1 
Prefixes Used with SI Units 


Prefix Prefix 
Factor Factor 
Name Symbol Name Symbol 
1018 exa E 10-1 (deci) d 
1015 peta P 10-2 (centi) c 
1012 tera T 10-3 milli m 
109 giga G 10-6 micro u 
106 mega M 10-9 nano n 
103 kilo k 10-12 pico p 
102 (hecto) h 10715 femto f 
101 (deca) da 10718 atto a 


Note. The prefixes enclosed in parentheses may be used only in the 
names of decimal multiples and fractions of units that are already 
widely used (for example, hectowatt, decalitre, decimetre, and centi- 
metre). 


TABLE I.2a 
Base Units of the International System (SI) 


Quantity SI units 
Dimen-| Name of | Unit a 
Name sions unit  |symbol Definition 
Length L metre m One metre is 1 650 763.73 


wavelengths in vacuum of 
the radiation correspond- 
ing to the transition 
between electronic ener- 
gy levels 2p,, and 5d, 
of the krypton-86 atom 
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TABLE I.2a (cont.) 


Quantity 
Dimen-}| Name of 

Name sions unit 
Mass M 
Time T second 
Electric cur-| I ampere 
rent 
Thermody- | © kelvin 


namic tem- 
perature 


Unit 
symbol 


kilogram] kg 


A 


679 


SI units 


Definition 


One kilogram is the mass 
of the platinum standard 
kept at the International 
Bureau of Weights and 
Measures at Sèvres, 
France 


One second is the dura- 
tion of 9 192 631 770 pe- 
riods of the radiation 
corresponding to the 
transition between two 
hyperfine levels of the 
ground state of the ce- 
sium-133 atom 


One ampere is the in- 
tensity of constant cur- 
rent which, when main- 
tained in two parallel 
straight conductors of 
infinite length and ne- 
gligible cross section 
placed one metre apart 
in a vacuum, would pro- 
duce between them a 
force equal to 2x10-7 N 
per metre length 


One kelvin is equal to the 
fraction 1/273.16 of the 
thermodynamic tempera- 
ture of the triple point 
of water 
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TABLE I.2a (cont.) 


Quantity 
Dimen-} Name of | Unit 
Name sions unit symbol 
Amount of | N mole mol 
substance 
Luminuous | J candela | cd 
intensity 
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SI units 


Definition 


One mole is the amount 
of substance of a system 
that contains as many 
elementary units (atoms, 
molecules, ions, elec- 
trons, other particles, 
or specified groups of par- 
ticles) as there are atoms 
in 0.012 kilogram of the 
pure nuclide carbon-42 


One candela is the lumi- 
nous intensity, in the 
perpendicular direction, 
of a surface of 1/600 000 
square metre of a black- 
body radiator at the tem- 
perature of freezing pla- 
tinum under a pressure 
of 101325 newtons per 
square metre, 1. C 
101 325 Pa 


Most extensively used in physics is the CGS-Gaussian, or sym- 
metrical, system of units, in which the units for measuring all 
electrical quantities are the same as in the CGS-electrostatic 
system and those for measuring magnetic quantities are the 
same as in the CGS-electromagnetic system. In the Gaussian 
system, as in the CGS-electrostatic system, the proportionality 
factor k (14.2.4) in Coulomb’s law is dimensionless and equal to 
unity, whereas the proportionality factor k in the Biot-Savarlt- 
Laplace law (23.2.1) is taken equal to k = 1/c, where c is the 


electrod ynamic constant (23.2.2). 
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I.8 Derived CGS units (Gaussian system) for electrical quanti- 
ties are listed in Table 1.5, and for magnetic quantities, in 
Table I.6. 

1.9 Off-system, or arbitrary, units, permitted for use in physics 
and astronomy, are listed in Table 1.7. 

1.10 Off-system units permitted for use in conjunction with SI 
units are listed in Table 1.8. 


TABLE I.2b 
Supplementary Units of the International System (SI) 


Quantity | SI units 
Dimen-| Name of | Unit ee 
Name sions unit  |symbol] Definition 
Plane — |radian rad | One radian is the angle sub- 
angle 3 tended at the centre of a 


circle by an arc the length 
of which is equal to the 


radius 
Solid — ‘|steradian] sr One steradian is the solid 
angle angle subtended at the cen- 


tre of a sphere by an area 
of surface equal to the square 
of the radius 
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Derived Units of the CGS System, Mcchanical Quantities 


Quantity 

Name > lone. 
Density L-3M 
Moment of L?M 
inertia 
Momentum LMT- 
Angular mo- | L2MT-2 
mentum 
Force ' LMT-2 
Moment of | L?MT-2 
force 
Impulse LMT-} 
Pressure, me- |L7~!MT~-2 
chanical stress, 
modulus of 
elasticity 
Surface tension| MT~-2 
Work, energy | L2MT~-? 
Power L2MT-3 


Dynamic vis- |L~!1MT~-1 


cosity 
Kinematic 
viscosity 


LT 


Derived CGS unit 


Name 


gram per cu- 
bic centi- 


metre 
gram-centi- 
metre 
squared 
gram-centi- 
metre per 
second 
gram-centi- 
metre 
squared per 
second 

dyne 
dyne-centi- 
metre 
dyne-second 
dyne per 
square cen- 
timetre 
dyne per 
centimetre 
erg 


erg per sec- 
ond 
poise 


stokes 


Expressed in 
SI units 


103 kg/m? 


10-7 kg-m? 


(g-cm)/s |10-5 (kg-m)/s 


10-7(kg/m?)/s 


10-5 N 
10-7 N-m 


10-5 N-s 
10-1 Pa 


10-3 N/m 


10-7 J 
10-7 W 


10-1 Pa-s 


10-* m?/s 
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TABLE I.5 
Derived Units of the CGS System, Electrical Quantities 
Quantity Derived CGS units 
Name Dimensions Name Se 
Electric current L3/2M1/2T-2 — 10/c A 
Electric current density | L~1/2M1/2T-2 — 10°/c A/m2 
Electric charge L3/2M1/2T-1 — 10/e C 
Electric charge density: 
(a) linear L!/2M1/2 T7: — 103/c C/m 
(b) surface L71/2M1/2T-1 — 10°/c C/m? 
(c) volume L73/2M1/2T-1 a 10?/c C/m3 
Polarization L7-1/2M1/2T-1 — 105/c C/m? 
Electric dipole moment | L5/2M1/2T-1 — 4/40¢ C-m 
Pee displacement | L3/2M1/2T~-1 — 10/4ste C 
ux 
Electric displacement L-1/2M1/2T-1 — 10°/4nc C/m? 
Electric potential, volt- | L1/2M+/2T-1 — 1078c V 
age, emf 
Electric field strength | L-1/2M1/2T-1 — 410-6c V/m 
Capacitance L centi- {10°%/c F 
metre 
Electric resistance LoL — 10-%c? Q 
Electric resistivity T — 10-11¢2 Q-m 
Electric conductance LT-! — 109/c? S 
TABLE I.6 
Derived Units of the CGS System, Magnetic Quantities 
Quantity Derived CGS units 
Name Dimensions Name or E i 


Magnetic flux | L3/2M1/2T-1 | maxwell Mx 10-8 Wb 
Magnetic in- | L~!/2M!/2T-1) gauss G 1074 T 
duction 
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TABLE I.6 (cont.) 


Quantity Derived CGS units 

Name Dimensions. Name eat Bes ira s 
Magnetomotive | L!/2M1/2T-1 | gilbert Gb 10/4n A 
force 
Magnetic field | L-1/2M!/2T—1] oersted Oe 103/4n A/m 
strength 
Inductance, L centimetre | cm 10-9 H 
mutual induc- 
tance 
Magnetic mo-| L3/2M1/2T-1 — — 10-3 A-m? 
ment of elec- 
tric current 
Magnetization | L7!/2M1/2T-1 — — | 103 A/m 
Magnetic resis- L-1 — — |109/4n A/Wb 
tance = (reluc- 
tance) 
TABLE I.7 


Off-System Units Permitted for Use in Physics and 
Astronomy 


Off-system units 


Quantity 


Name Symbol Expressed in SI units 


Length astronomical unit| A.U. 1.49600 x 101! m 


light-year It-yr |9.4605x10!5 m 
parsec pe 3.0857 «1016 m 
X-unit X 1.00206 x 10-13 m 

Mass atomic mass unit u 1.66057 X10727 kg 
(unified) 

Area barn b 10-28 m2 

Energy electronvolt eV 1.60219 107-19 J 


1/2 44-0186 
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TABLE I.8 


Off-System Units Permitted for Use in Conjunction With 
SI Units 


Quantity Off-system units 
‘ Expressed in 
Name Name of unit Symbol SI units 
Area (plots of | hectare ha 104 m? 
ground) 
Volume litre l 10-3 m3 
Plane angle degree dse 1/180 rad 
minute wee 1/10 800 rad 
second Stans 1/648 000 rad 
Time minute min 60 s 
hour h 3600 s 
day d 86 400 s 
week W — 
month mo — 
year yr — 
Mass tonne t 108 kg 
Celsius temper- j| Celsius degree °C Celsius tempera- 
ature, temper- ture t= 7—273.15, 
ature differ- where T is the 
ence Kelvin tempera- 


ture. One Celsius 
degree is equal to 
one kelvin 


li. FUNDAMENTAL PHYSICAL CONSTANTS 


¥I.4 Listed in Table II.1 are the fundamental constants com- 
piled under the auspices of the CODATA Task Group on Fun- 
damental Constants. They have been officially adopted by 
CODATA and are taken originally from J. Phys. Chem. Ref. 
Data, Vol. 2, No. 4, p. 663 (1973) and CODATA Bulletin No. 414 
(December 1973). 
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Aberration, geometric, 477 
Absolute zero temperature, unat- 
tainability of, 194 
Absorptance, 514 
Absorption band, 484 
coefficient for elastic waves, 
linear, 410 
factor, 483 
of light, 482 
of light, resonance, 484 
negative, 573 
resonance, 646, 652 
spectrum, continuous, 484 
wave, 410 
Absorptivity, 483 
Acceleration, 27 
absolute, 116 
angular, 32 
average, 27 
centripetal, 29 
Coriolis, 117 
due to gravity, 121 
of free fall, 121, 694 
instantaneous, 27 
linear, 33 
normal, 29 
of pendulum, 392 
relative, 116 
rotary, 33 
tangential, 28 . 
‘of transportation, 117 
Accelerator, linear electrostatic, 
319 


linear resonant, 319 
particle, 318 
resonant cyclic, 319 
Acceptor level, 619 
Action radius, 631 
range, 631 
Active medium, 574 
Activity, 637 
specific, 637 
Adiabat, 148 
Adiabatic equation, 148 
Affinity, electron, 565 
Ammeter shunt, 277 
Ampere, 685, 688 
Ampere’s force, 291 
Ampere’s law, 291 
Amplitude, beat, 377 
of oscillation, 363 
wave, 403 


Analyser, 496 
Angle, position, 30 
of rotation, 30 
Angular frequency of wave, 403 
Angular momentum, about a 
fixed axis, 72 


about a fixed point, 72 

intrinsic, 336 

orbital, 335, 337 

variation law of, 77 
Anharmonicity, coefficient of, 581 
Anisotropy, crystal line, 580 
ea iod; positron-electron, 

67 

Anode, 278 
Antinode, 417 

standing wave, 417 
Antiparticle, 671 
Antiproton, 671 
Anti-Stokes line, 568 
Anti-Stokes luminescent 

tion, 571 

Aperture, numerical, 477 
Apertude diaphragm, 476 
Arm, of couple, 73 

of dipole, 227 


Aston mass spectrograph, 317 
Astronomical unit, 689 
Atom, 554 
donor, 618 
Atomicity of 
280 


radia- 


electric charges, 

Atomic mass unit, 689, 691 

Atomic nuclear stability crite- 
rion, 631 


Atomic plane, 475 

Autophasing, 320 

Average values, 128 

Avogadro constant, 691 

Avogadro’s law, 134 

Avogadro’s number, 133, 6914 

Axis, free, 83 

of rotation, 30 

instantaneous, 32 


Babinet’s theorem, 473 
Balmer-Rydberg formula, 554 
Balmer series, 555 

Barn, 689 

Barometric height formula, 160 


696 


INDEX 


Barrier, potential, 66 
Barrier layer, 622 
effect, 622 
Baryon, 662, 669 
charge, 669 
Basis, orthonormal, 21 
Beat, 377 
Beer’s law, 483 
Billet split lens, 447 
Biot-Savart-Laplace law, 292 
Biot-Savart. Law, see _ Biot-Sa- 
vart-Laplace law 
Biot’s law, 510 
Birefringence, 498 
Blackbody, 514 
Blackness, total degree of, 516 
Blocking layer, 623 
Body, homogeneous, 39 
perfectly elastic, 20 
perfectly inelastic, 20 
perfectly rigid, 20 
of reference, 20 
rigid, 20 
Bohr frequency condition, 557 
Bohr magneton, 337, 692 
Bohr radius, 559, 692 
a e a principle, 


Bohr’s postulates, 557 
Boiling, 218 
Boiling point, 219 
Boltzmann constant, 134, 693 
Boltzmann distribution law, 159 
Boltzmann relation, 188. 
Bond, covalent, 569 
interatomic chemical, 563 
ionic, 564 
length, 564 
metallic, 580 
Bose-Einstein distribution func- 
tion, 590 
Boson, 589 
PONET T eee EATEN 
7 
Bouguer-Lambert law, 482, 574 
Bounta y continuous-spectrum, 
ta) 


law, 


7 
Brackett series, 555 
Bragg’s law, see Wulff-Bragg 
condition 
Branch point, 276, 307 
Brewster’s angle, 438, 496 
Brewster’s law, 496 
Brightness temperature, 523 
Brownian movement, 192 


Candela, 686, 689 

Capacitance, 244 
mutual, 245 

Capacitive reactance, 397 

Capacitor(s), 246 
cylindrical, 247 
parallel-connected, 248 
parallel-plate, 246 
plate, 246 
series-connected, 248 
spherical, 246 

Capillary tube, 216 

Capture, neutron, 652 
resonance, 652 

Carbon-nitrogen cycle, 656 

Carnot cycle, 178 
reverse, 180 


Carnot theorem, 184 
Cathode, 278 
photoelectric, 525 
Cation, 278 
Causality, 88 
Celsius, degree, 690 
scale, 130 
n scale, international, 
Centre, impurity, 617 
of vaporization, 218 
CGS system of units, 677 


ee in momentum, law of, 


Charge, bound, 255 
conjugation principle, 671 
density, linear, 227 
surface, 227 
volume, 227 
distribution, continuous, 226 
discrete, 226 
free, 255 
of nucleus, 626 
Charge-to-mass ratio, 317 
Cherenkov radiation, see Vavi- 
lov-Cherenkov radiation 
Circle, osculating, 22 
Circuit, active portion of, 273 
branched, 275 
magnetic, 306 
passive portion of, 273 
Circulation, of field strength vec- 
tor, 235 
of Mo at induction vector, 
Clapeyron-Clausius equation, see 
Clausius-Clapeyron equation 
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Clapeyron equation, 133 
Classical mechanics, 19 
py age ode equation, 219, 


Clock, paradox, 94 

synchronism, 87 
Closed mechanical system, 38 
Coercive force, 260, 348 
ener aes dimension, 


length, 446 
spatial, 452 
of light, degree of, 450 
spatial, 452 
volume of, 453 
Colliding beam technique, 323 
Collision, 66 
central, 66 
glancing, 66 
head-on, 66 
perfectly elastic, 68 
inelastic, 67 
Collisions, mean number of, 161 
Colour temperature, 524 
Composition of velocities, 97 
Compound pendulum, equivalent 
length of, 370 
Compression, uniaxial, 584 
Compton, effect, 530 
recoil electron, 531 
wavelength, 530 
of electron, 693 
of neutron, 693 
of proton, 693 
Concentration of solution, equiv- 
alent, 28 
Conducting direction, 623 
Conduction, band, 614 
electron, 616 
extrinsic, 618 
heat, 138 
hole, 617 
impurity, 618 
intrinsic, 616 
in semiconductors, 
616 
n-type, 616 
p-type, 617 
unilateral, 622 
Conductivity, electrical, 268 
Configuration, of system, 59 
zero, 60 
Conservation, of angular momen- 
tum, 81 
of baryon charge, 670 
of electric charge, 220 


45—0186 


spatial, 


electrical, 


of energy, 102, 140 
of isotropic spin, 669 
laws of elementary-particle 
physics, 668 
of mass, 39 
of mechanical energy, 63 
of momentum, 46 
of relativistic mass, 99 
of relativistic momentum, 99 
of strangeness, 670 
Conservative system, 55 
Constraint, 37 
Contact layer, 620 
Continuum, four-dimensional 
space-time, 111 
Contrast, degree of, 449 
Conversion, internal, 643 
Cooler, 179 
Corkscrew rule, 290 
Cotton-Mouton, constant, 510 
effect, 509 
Coulomb, 684 
force, 222 
law, 222 
Couple, 73 
Critical angle, 440 
Crystal, biaxial, 498 
covalent, 580 
dextrorotary, 510 
ionic, 580 
lattice, 579 
levorotary, 510 
molecular, 580 
optically positive, uniaxial, 501 
uniaxial, 498 
valence, 580 
Crystallite, 579 
Curie, law, 344 
point, 259, 347 
Current, 265 
carrier, 264 
conduction, 264 
convection, 264 
direct, 265 
electric, 264 
induced, 323 
molecular, 344 
orbital, 335 
photoelectric, 525 
quasi-stationary, 372 
saturation photoelectric, 526 
self-induction, 329 
Curvature, centre of, 22 
Curve, adiabatic, 148 
potential energy, 66 
Cybotaxic region, 208 
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Cycle, 177 
direct, 177 
reverse, 178 

Cyclotron, 319 
frequency-modulated, 321 


Damped oscillations, free, 385 
Damping, logarithmic decrement 


ol, 
of oscillations, 384 
Damping factor, 384 
De Broglie, formula, 536 
wave, 536 
Debye a temperature, 
609 


number, 287 
Debye-Hiickel 
286 

Debye-Langevin formula, 253 
shielding length, 286 
T? law, 610 
Debye-Scherrer-Hull powder pho- 
tograph, 476 
Decay constant 635 
Dee, 349 
De-excitation time, average, 432 
Deformation, 582 
plastic, 583 
relative, 583 
Degeneracy parameter, 593 
Degenerate system, 592 
Degree, 690 
of freedom, 23 
Deionization, 280 
of gas, 289 
Density, 39 
average, 39 
of current heating 
volume, 269 
displacement current, 353 
probability, 538 
radiant flux, 522 
Depletion range, 623 
Derived units, 682 
Detailed balance principle, 515 
Diagram, pT, 587 
Diamagnetic material, 340 
Diatomic gas molecule, 163 
Dichroism, 504 
Dielectric, 249 
constant, 222 
absolute, 224 
complex, 490 
nonpolar, 249 


screening radius, 


capacity, 


polar, 250 
polarized, 252 
Diffraction angle, 467 
circular aperture, 466, 469 
convergent ray, 465 
grating, constant, 470 
one-dimensional, 470 
space, 474 
light, 465 
minimum, principal, 471 
narrow long slit, 467 
parallel ray, 465 
small disk, 467 
spectrum, 472 
Diffusion, 170 
coefficient, 171 
Dimension, proper, 91 
formula, 676 
Dipole, moment, permanent, 250 
quasi-elastic (induced), 250 
rigid, 250 
Discharge, gas, 283 
Dispersion, of light, 487 
rotary, 510 
of waves, 411i 
Displacement, electric, 231 
magnetic, 290 
static, 390 
Dissipative system, 64 
Dissociation, degree of, 280 
Distinguishability of identical 
particles, 151 
Distribution, law, free path, 160 
molecular velocity, 154 
Disturbance, mechanical, 399 
Domain, 258, 348 
Donor level, 618 
Doppler effect, 420, 440 
longitudinal, 442 
transverse, 443 
Doppler spectral line 
ing, 443 
Dose rate of radiation, 644 
Double-magic number, 631 
Drift, 2814 


broaden- 


Drude-Lorentz electron theory, 
267 

Dulong-Petit law, 605 

Dynamic viscosity coefficient, 
172 


Dynamics, 19 
relativistic, 100 
of rigid body rotating about 
fixed axis, fundamental law 
of, 79 
Dyne, 687 
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kilfective cross section, 649 
isigenfunction, 540 
Kiinstein’s, general theory of rel- 
ativity, 111 
photoelectric emission equation, 
527 


relativity principle, 35, 85, 441 
Elastic body, 398 l 
hilasticity, coefficient of, 583 

degree of, 583 

modulus of, 584 

of volume, 398 
Elastic limit, proportional, 583 
Elastic wave, longitudinal, 400 

sinusoidal, 402 

transverse, 400 
Electric charge, negative, 220 

point, 222 

positive, 220 
Electric constant, 224 
Itlectric dipole, 230 

moment, 230 

induced, 249 
Electric displacement flux, 2314 
Electric energy source, 265 
PECS fields, superposition of, 

227 

Electric layer, double, 620 
Electric radius of proton, 674 
Electrochemical equivalent, 278 
Electrode, 278 
Electrodynamic constant, 677 
Electrodynamics, 264 

principal problem of, 350 
Electrolytic dissdciation, 278 
Electromagnetic induction, 323 
ltlectromagnetic system, absolute, 

676 


kiiectromagnetic wave, 422 
intensity of, 429 
pressure of, 430 

Electromotive force, 272 

Electron, 220 
charge to mass ratio, 692 
collective, 267 
conduction, 267, 616 
conversion, 643 
optical, 481 
magnetic moment, 692 
radius, classical, 692 
rest energy, 694 

mass, 691 
Shell, 562 
temperature, 289 


45% 


valence, 563 
lilectron-hole junction, 624 
Electronvolt, 689 
ltlectrostatics, 220 
Electrostatic system, 

677 
Electrostriction, 223 
ltlementary charge, 691 
Elementary particles, 660 
mf, 272 
Emissive power, 514 
_ total, 543 
Emittance, radiant, 513 
Energy, 54 

activation, 209, 616 

atomic unit of, 629 

band, 6114 

binding, 555 

bound, 188 

in ore force field, potential, 


absolute, 


of charged capacitor, 261 
of charged conductor, 
sic, 260 

of currents, mutual, 334 

density, volume, 262, 333, 427 
kinetic, 407 
potential, 408 

dissipation of, 64 

dissociation, 564 

efficiency, photoluminescent, 

571 

eigenvalues of, 540 

of elastic body, potential, 62 

of electrostatic field, 261 

cquipartition of, 164 

flux, 409 

free, 187 

internal, 135 

intrinsic, 332 

kinetic, 56, 367 

level, 543 
impurity, 617 
quasi-continuous, 543 
width, natural, 611 

magnetic field, 332 

mutual potential, 196 

nuclear reaction, 649 

of nucleon, binding, 628 

potential, 59, 367 

quantum, 519 

rest, 102 

source, electric, 271 

spectrum of particle, 540 

of system, binding, 104 


intrin- 


700 


Energy, total, 134 
mechanical, 63 
transfer velocity, 409 
of translational motion, average 
kinetic, 154 
of two particles, 
tential, 62, 108 
unavailable, 188 
in uniform field, potential, 60 
zero-point, 545 
vibrational, 168 
Entropy, 186 
Equation, in dynamics of transla- 
tion, basic, 44 
of forced electrical oscillation, 
differential, 394 
of forced oscillation, differen- 
tial, 388 
of state, calorific, 136 
of state, thermal, 131 
Equations of motion, 21 
Equilibrium, mechanical, 64 
Equilibrium state, 129 
Equipotential surface, 242 
Equivalence principle, 123 
local, 124 
Erg, 687 
Euler formula for complex num- 
bers, 365 
Sxchange interaction, quantum- 
mechanical, 565 
Excited state, 555 
Evaporation, 217 
\ixistance, radiant, 514 
Expansion, thermal, 580 
Expansion coefficient, 580 
Exponent, adiabatic, 148 
Exposure dose rate, 644 
Extension, longitudinal, 584 
ixternal body, 38 
Extraordinary ray, 499 


mutual po- 


Fabry-Perot etalon, 457 
Fajans-Soddy law, 636 
Farad, 685 
Faraday, constant, 692 
effect, 5414 
law of electromagnetic 
tion, 323 l 
laws of electrolysis, 278, 279 
number, 692 
Fermi, constant, 663 
energy, 595 
level, 596 


induc- 
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Fermi-Dirac distribution func- 
tion, 594 
Fermions, 590 
Ferroelectric material, 257 
Ferromagnetic material, 346 
Fick’s law, 170 
Field, 36 
of 7 forces, conservative, 
conservative nonstationary, 55 
stationary, 54 
electromagnetic, 225 
electrostatic, 224 
force, 224 
gravitational, 107 
induced electric, 323 
of interference, 447 
magnetic, 290 
physical, 36 
potential steady, 54 
radiation, 430 
rotational, 305 
stationary, 36 
steady-state electric, 224 
strength, electric, 225 
magnetic, 294 
uniform electric, 225 
Fine-structure constant, 664, 691 
First radiation constant, 694 
Fission parameter, critical, 654 
Fluctuation, 189 
absolute, 190 
mean square, 190 
relative, 190 
Fluidity, 208 
Fluorescence, 570 
Flux, 305 
density, magnetic, 290 
linkage, 310 
of NEUE induction vector, 
Focusing, axial, 353 
double, 318 
radial, 353 
strong, 322 
Focusing condition, weak, 322 
Forbidden band, 612 
Force(s), 35 
active, 37 
arm of, 70 
of attraction, 195 
induction, 198 
orientational, 197 
central, 221 
centre of, 60 
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lorce(s), centrifugal, 118 
centripetal, 42 
coercive, 260, 348 
conservative, 53 
Coriolis inertial, 118 
Coulomb, 222 
couple, 73 
dispersion, 198 
dissipative, 55 
disturbing, 388 
driving, 388 
elastic, 583 
electromotive, 272 
external, 38 
extraneous, 269 
field, conservative, 235 
function, 55 
gravitational, 104 
gravity, 120 
gyroscopic, 55, 119 
impact, 66 
impulse of, 40 
independence of, 42 
inertial, 118 
internal, 38 
Lorentz, 310 
magnetic, 311 
magnetizing, 306 
magnetomotive, 306 
motive, 52 
nonconservative, 63 
nuclear, 631 
potential-field, 53 
quasi-elastic, 366, 584 
reactive, 44 
of repulsion, 195 
of resistance, 52 
resultant, 36 
retarding, 52 
short-range, 195 
steady, 59 
of transportation, inertial, 118 
van der Waals, 197 

Fourier heat conduction law, 173 

Frame of reference, 20 
chronometry of, 85 
heliocentric, 35 
inertial, 35 
laboratory, 35 
noninertial, 115 

Fraunhofer diffraction, 465 

Free body, 37 

Freedom, degree of, 162 

Free fall of a body, 124 

Free rotation, 83 

Freezing point, 587 
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Frenkel’s hole theory, 208 
Frequency, angular, 363 
beat, 377 
basic angular, 378 
of oscillation, 363 
plasma, 371 
resonance, 391 
of vibration, natural, 419 
wave, 402 
Fresnel 
biprism, 447 
diffraction, 465 
formulas, 438 
mirror principle, 446 
zone, 464 
Fringe(s) 
equal-inclination, 455 
equal-thickness, 456 
interference, 448 
Fundamental physical constants, 


Fusion, 585 


Galilean transformation, 48 
wanes principle of relativity, 


Gamma radiation, 435 
Gamma ray, 642 
Gas, dilute, 175 

electric breakdown of, 285 

electron, 242 

ideal, 132 

rarified, 175 

real, 194 

van der Waals, 136, 200 
Gas constant, 133 

specific, 133 

universal, 134 
Gas discharge, 283 

self-maintained, 284 

semi-self-maintained, 283 
Gauss, 688 

theorem, see 

Gauss theorem 

Gaussian system, absolute, 677 
Generator. coherent light, 576 
Gilbert, 689 
Grain, crystal, 579 
Gram equivalent, 279 
Gravitational constant, 105, 694 
Gravitational red shift, 648 
Gravitation, relativistic theory 


Ostrogradsky- 


of, 141 
Gravity, zero, 123 
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Grazing angle, 475 
Greybody, 515 
Ground state, 555 
Group of bands, 566 
Gyromagnetic ratio, nuclear, 628 
of orbital magnetic moments, 
336 


of spin moments, 336 
Gyroscope, 77 
- balanced, 78 
regular precession of, 77 
symmetrical, 77 


Hall effect, 314 
Harmonic oscillator, 
Harmonics, 379 
quantity of, 137 
reduced quantity of, 185 
of vaporization, 219 
of vaporization, internal, 219 
of pane oe specific latent, 


Heat 
capacity, 143 
molar, 143 
quantum theory of, 165 
specific, 143 
death, 189 
exchange, 138 
convective, 138 
radiative, 138, 512 
flow, specific rate of, 173 
of fusion, 586 
latent, 586 
Heat sink, 179 
Heat source, 179 
Heater, 179 
Hectare, 690 


linear, 368 


Heisenberg indeterminacy rela- 
tions, 548 

Henry, 685 

Hertz, 682 


High-energy physics, 662 
Hologram, 478 
Holography, 478 
Hooke’s law, 583 

for volume deformation, 399 
Hopkinson formula, 306 
Humphreys series, 555 
Huygens-Fresnel principle, 463 
Huygens principle, 
pees of energy bands, 


Hyperon, 662, 668 


INDEX 


Hysteresis, dielectric, 259 
magnetic, 347 
Hysteresis loop, 348 


Impact, line of, 66 
oblique, 66 
Impedance, 397 
of oscillatory circuit, 373 
Impulse of force, 40 
Incidence, of light, inclined, 472 
plane of, 436 
Indistinguishability of 
particles, 589 
Induction, clectric, 2314 
magnetic, 290 
mutual, 330 
Inductive reactance, 397 
Inertia, centre of, 39 
law of, 34 
principal axis of, 76 
Infrared radiation, 435 
Initia] instant, 23 
Instantaneous acceleration, 27 
Instantaneous power, 56. 398 
Tnstantaneous velocity, 24 
Intensity, electric, 226 
gravitational field, 107 
of ionization, 283 
of light, 429 
magnetization, 340 
radiant, 522 
wave, 410 
Interaction 
coupling constant, 663 
direct, 650 
electromagnetic, 664 
nuclear strong, 628 
quantum-mechanical 
565 
strong, 664 
time constant of, 663 
weak, 
Interference 
aperture angle of, 447 
wave, 414 
fringe, 448 
maximum, order of, 445 
minimum, 
Internal friction, 171 
International System (SI), 678° 
Tnterplanar spacing, 475 
Interval,. spacelike, 95 
space- -time, 94 
timelike, 95 


identical 


exchange, 
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Inverted state, 575 


Ton 
drift, 281 
hydrogen-like, 554 
isoelectronic, 554 
Ionization, of atom, 282, 555 
collision, 283 
degree of, 286 
intensity of, 283 
surface, 285 
volume, 285 
Tonosphere, 286 
Trradiance, 522 
Tsobar, 626 
Tsochromate, 508 
Tsolated mechanical system, 38 
Tsopathic fringes, 455 
Tsoprocess, 132 
Tsospin, 669 
Tsotherm, 148, 202 
critical, 202 
Tsotope, 626 
Tsotope effect, 601 


Joule, 683 

Toule-Lenz law, 269, 275 
Joule’s law, see Joule-Lenz law 
Junction, 276, 622 


Kaon, 668 
Kelvin, 130, 679 
Kepler’s laws, 1413 
Kerr 
constant, 509 
effect. electrooptical, 509 
law, 509 
Kilogram, 679 
kusmai. viscosity coefficient, 
i7 
Kinematics, 19 
relativistic, 97 
REG theory, 
of, 
of gases, 151 
Kirchhoff’s function, 545 
Kirchhoff’s law for radiation, 515 
Kirchhoff’s laws, 276 
for aS magnetic circuits, 
7 
K meson, 668 
Knudsen effect, 176 
Koenig’s theorem, 58 


basic equation 


Lambert law, 482 
Langevin-Debye formula, see De- 
bye-Langevin formula 
Langmuir oscillations of plas- 

ma, 374 


Langmuir plasma frequency, 371 
Larmor precession, angular ve- 
locity of, 338 


Larmor’s theorem, 338 
Laser, 576 
Lattice constant, 579 
Lattice plane, 475 
Lauc condition, 474 
Lauc diffraction pattern, 476 
Launching mass, 45 
Left-hand rule, 2914 
Lenz law, 324 
Lepton, 662 
Lifetime, average, 635 
mean proper, 92 
Light, 433 
natural, 495 
optical path of, 453 
partially polarized, 495 
pressure, 529 
reflection law, 462 
refraction law, 462 
visible, 435 


Light-year, 93, 689 
Line of action, 36 
Lines, of force, 226 
of magnetic force, 290 
Linewidth, natural, 645 
Liquid, 207 
boiling, 202 
nonwetting, 2413 
superheated, 205 
wetting, 213 
SAE OE model of nucleus, 
Lissajous figures, 383 
Litre, 690 
ea ale Gerald contraction, 


Lorentz 

force, 310 

generalized, 313 

formula, 3413 

invariance, 90 

transformation, 89, 442 
Lumen, 686 
Luminance temperature, 523 
Luminescence, 570 
Luminophor, 570 
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Lux, 686 
Lyman series, 555 


Macrocurrent, 344 
Magic number, 631 
Magnetic constant, 293 
Magnetic field, 290 
internal, 344 
uniform, 291 
Magnetic flux, 305 
of mutual induction, 331 
quantum, 692 
of self-induction, 327 
ees ar interaction, force of, 
Magnetic material, soft, 348 
retentive, 348 


Magnetic moment, of closed 
loop, 297 
nuclear, 627 
orbital, 335, 337 
spin, 336 
relative, 


Magnetic permeability, 
293 


Magnetic resistance, total, 307 
Magnetic rotation, 511 
Magnetization, remanent, 348 
saturation, 344, 348 
uniform spontaneous, 349 
Bina e ocsnamic generator, 


Malus law, 496 
Maser, 576 
Mass 
centre of, 39 
defect, 104, 629 
dependence on velocity, 99 
energy relation, 102 
flow, specific, 170 
initial, 45 
molar, 133 
number, 626 
of a particle, 38 
rest, 99 
spectrometry, 317 
transfer, 137 
Material point, 20 
Maximum, order of, 459 
subordinate, 458 
Maxwell, 688 
rule, 290 
Maswi Boltzmann, distribution, 


law, 160 


Maxwellian distribution, 154 
Maxwell’s cqual area rule, 205 
Maxwell’s equations, 351 
complete set of, 358 
Maxwell’s theory, 350 
Mayer equation, 146 
Mean free path, 164 
Measurement, 550 
Mechanical motion, 19 
Mechanical system, 20 
Mechanics, 19 
quantum, 19, 535 
statistical, 128 
Medium, continuous, 400 
dispersive, 411 
isotropic, 401 
linear, 404 
uniform, 4014 
Melting, 585 
point, 585 
Mende tcy-Glapeyzan equation, 
Mendeleev’s periodic system, 562 
Meniscus, 213 
Mesh rule, 276 
Meshchersky equation, 44 
Meson, 662 
Metal, 580 
Metre, 678 
Metric, i111 
MHD generator, 287 
Microcurrent, 344 
Microfield, averaged, 351 
Mie effect, 486 
Minimum, additional, 471 
Mmf, 306 
Mobility, ion, 284 
of molecule, 211 
Modulation, amplitude, 380 
frequency, 380 
phase, 380 
Modulus, of elasticity, 584 
of elasticity, bulk, 399 
shear, 585 
Molar gas constant, 693 
Molar volume of ideal gas, 693 
Mole, 680 
seen action, range of, 127, 


sphere of, 200 
Molecular-kinetic theory, 126 
Molecule, 563 

atomic, 565 

diatomic gas, 163 

effective diameter of, 132 

heteropolar, 564 


INDEX 


Molecule 
homopolar, 565 
ionic, 564 
ionization of, 282 
monatomic gas, 162 

Moment of force, 

axis, 71 
about fixed point, 70 

Moment of inertia, 74 
principal, 77 

Moment, resultant, 70 

Moments of inertia of regular 

bodies, 76 

Momentum, 40 
angular, 72 
moment of, 72 
relativistic, 99 

Monatomic gas molecule, 162 

Monocrystal, 579 

Mosley’s law. 572 

Mossbauer effect, 647 

Motion 
absolute, 115 
accelerated. 25 
aperiodic, 388 
of centre of mass, 43 
curvilinear, 22 
decelerated, 25 
differential equatfon of, 42 
equations of, 21 
in frame of reference fixed to 

earth, relative, 119 
of free rigid body, 80 
independent, :23 
mechanical, 19 
plane, 22, 34 
rectilinear, 22 
relative, 116 
rotary, 30 
thermal, 126 
translational, 29 
uniform, 25 
uniformly accelerated, 28 
decelerated, 29 
variable, 28 

Mu meson, 665 

Muon, 93, 665 
antineutrino, 665 
neutrino, 665 

Mutual induction, 330 


about fixed 


Nernst heat theorem, 193 
Nernst principle, 194 
Neutrino, 97 

Neutron, 626 
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fission, 654 

multiplication factor, 654 

prompt, 654 

rest energy, 694 

rest mass, 692 

slow, 652 

thermal, 652 
Newton, 684 

ring, 456 
Newtonian mechanics, 19 
oe first law of dynamics, 

ur universal gravitation, 

second law, 40 

third law, 42 

viscosity law, 172 
Node, standing wave, 417 

of wave function, 545 
Nonconservative system, 55 
Nonwetting liquid, 213 
Normal, principal, 22 
Nozzle velocity, 46 
N-type conduction, 616 
Nuclear fission, spontaneous, 654 
Nuclear forces, exchange inter- 

pretation of, 633 

saturation in, 632 
Nuclear, magneton, 627, 693 

power engineering, 654 

radius, 627 

reaction, 648 

reactor, 654 

recoil, 646 

time, 663 
Nucleon, 626, 662 

number, 626 

structure, 673 
Nucleus, 626 

compound, 650 

daughter, 635 

mirror, 631 

parent, 635 


Object beam, 478 
Oersted, 689 
Ohm, 685 
Ohm’s law, for closed magnetio 
circuits, 306 
for current density, 268, 598 
of current density in electro- 
lytes, 281 
in differential form, 268 
generalized, 274 
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Optical axis, 498 
Optical homogeneity, 475 
Optically active substance, 510 
Optical-path difference, 453 
Optical pyrometer, 522 | 
Optical radiation, 435 
Optics, quantum, 524 
Orbit quantization rule, 557 
Order in liquids, short-range, 208 
Order, of principal maximum,471 
Ordinary ray, 499 
Oscillation(s), 362 
addition of, 374 
antiphased, 376 
circularly polarized, 382 
coherent, 376 
cophased, 376 
electromagnetic, 362, 374 
electromechanical, 362 
elliptically polarized, 381 
forced, 362 ; 
free, 362 
free damped, 385 
harmonic, 363 
linearly polarized, 382 
mechanical, 362 
quasi-periodic, 380 
Oscillator, 430 
linear harmonic, 431, 543 
strength, 492 
Oscillatory circuit, 372 
Ostrogradsky-Gauss theorem, 233 
for magnetic fields, 306 


Packet, wave, 411, 445 
Paramagnetic material, 342 
Parsec, 689 
Particle(s), 20 
elementary, 660 
steady state of, 540 
Pascal, 683 
Paschen series, 555 
Path, 22 
difference, 415 
length, 22 
Pendulum. compound, 368 
simple, 370 
spring, 368 
Penetrability factor, 551 
Perfectly, elastic body, 20 
inelastic body, 20 
rigid body, 20 
Period, beat, 377 
of oscillation, 363 


of revolution, 32 
PER ete absolute magnetic, 


of free space, 293 
relative magnetic, 293 
Permittivity, absolute, 224 
of free space, 224 

relative, 222 
Perpetual motion 
first kind, 141 
of second kind, 183 
Pfund series, 555 
Phase, 302 
of oscillation, 363 
of plane wave, 403 
Phase, stability principle, 320 
transition, first-order, 219 
second-order, 205 
Phase velocity, 406, 423, 536 
Phasotron, 321 
Phenomenological 
mics, 129 
Phonon, 603, 608 
Phosphorescence, 570 
Photoconduction, 525 
Photoelasticity, 508 
ts a effect, barrier-layer, 
in gases, 525 
internal, 525 
multiple-photon, 528 
threshold, 526° 
Photoelectron, 525 
Photo-emf, 525 
Photoemission, 525 
Photoionization, 525 
Photoluminescence, 571 
quantum yield of, 571 
Photon, 97, 527 
mass of, 528 
momentum of, 529 
Physical quantity, quantized, 542 
Physics, classical statistical, 154 
high-energy, 662 
molecular, 126 
statistical, 128 
Pi meson, 666 
Pion, 666 
triplet, 669 
Planck 
constant, 521, 691 
quantum hypothesis, 519 
radiation formula, 521 
Plane, osculating, 22 
of polarization, 426 


machine, of 


thermodyna- 
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Plane-parallel motion, 34 
Plasma, 286 
fully ionized, 286 
gas-discharge, 289 
gascous, 287 
high-temperature, 286 
isothermal, 288 
weakly ionized, 286 
Pleochroism, 
P-n junction, 624 
Point, lattice, 127 
Point rule, 276 — 
Poise, 687 
Poisson 
equation, 148 
spot, 467 
Polarizability, 249 
Polarization, 252 
charge, surface-bound, 254 
current density, 355 
electronic, 252 
ionic, 252 
of light, 495 
orientational, 252 
remanent, 259 
spontancous, 258 
of eee sinusoidal waves, 
4 
Polarizer, 495 
Polarizing prism, 503 
Pole, 70 : 
Polycrystal, 579 
Population inversion, .575 
Positive hole, 
Positron, 220 - 
Postulate of special 
relativity, 84, 
Potential 
barrier, 551 
chemical, 590 
difference, contact, 621 
internal contact, 621 
electric field, 234 
electrostatic field, 238 
of gravitational field, 109 
ignition, 285 
ionization, 282, 555° 
jump, surface, 620 
retarding, 525 
well, 66, 5414 
Power, 56 
active, 398 
average, 56 
factor, 398 
Poynting vector, 428 


theory of 


Precession, regular, 77 
Prefixes used with SI units, 678 
Pressure, 129 
internal, 201, 212 
Probability density, 538 
Process, adiabatic, 132 
equilibrium quasi-static, 131 
irreversible, 182 
isobaric, 132 
isochoric, 132 
isothermal, 132 
isovolumic, 132 
nonequilibrium, 132 
thermodynamic, 131 
Product of inertia, 75 
Proton, 220, 626 
magnetic moment, 693 
proton chain, 655 
proton doublet, 669 
rest energy, 694 
rest mass, 691 
Pseudovector, 30 
Psi function, 538 
P-type conduction, 617 
P-wave, 437 
Pyrometry, optical, 522 


Q factor, 387 - 
Quadrupole moment of. nucleus, 
628 


Quality factor, 387° >- z 
Quantum, of circulation, 692.: 
of energy, 520 ` 
mechanics, 19, 535 
number, 543 
magnetic, 559 
orbital, 558 
principal, 555 
spin, 560 
spin magnetic, 560 
optics, 
states, number of, 589 
statistics, 588 
theory of electrical: conduction 
in metals, 598 
Quark, 663 
Quarter-wave plate, . 506 
Quasi-momentum, 608 
Quasi-particle, 608 


Radian, 681 
Radiance, 522 


708 


Radiant flux, 522 
Radiant flux density, 522 
Radiating system, 430 
Radiation, anti-Stokes 
cent, 571 
dose rate of, 644 
absorbed, 645 
equilibrium, 512 
isotropic, 513 
gamma, 435 
infrared, 435 
luminescent, 570 
optical, 435 
pyrometer, 522 
stimulated, 573 
temperature, 523 
thermal, 511 
ultraviolet, 435 
Vavilov-Cherenkov, 430, 492 
X, 435 
Radiator, cosine, 523 
Radioactive decay law, 636 
Radioactive family, 637 
Radioactive half-life, 636 
Radioactivity, 633 
artificial, 635 
induced, 635 
natural, 635 
Radio-frequency bands, 434 
Radius of curvature, 22 
Raman 
effect, 568 
scattering of light, 568 
spectrum, 569 
Rationalized equations, 293 
Rationalized law, 224 
Ray, 402 
incident, 436 
reflected, 436 
refracted, 436 
Rayleigh-Jeans law, 518 
Rayleigh 
criterion, 477 
scattering, 485 
Reactance, 397 
Reaction, chain fusion, 655 
constraint, 37 
endothermic, 649 
exothermic, 649 
fission, 653 
nuclear, 649 
thermonuclear, 657 
Reaction, channel, 649 
cross section, 649 
Recombination, 282 
Rectifying effect, 622 


lurmines- 


INDEX 


necu ey light propagation law, 


Reference, beam, 478 
frame, 21 
Reflection, angle of, 436 
coefficient, 439 
law of, 437 
total internal, 440 
Refraction, double, 498 
law of, 437 
Refractive, angle, 436 
index, absolute, 435 
complex, 483 
relative, 436 
Relationship, cause-and-effect, 88 
aay dynamics, basic law 
ol, 
effect, 84 
invariance, 90 
mechanics, 19 
theory, 84 
PERUN mechanical principle 
of, 
principle of, 85 
special theory of, 84 
theory, restricted, 84 
Relaxation time, 209 
Releasability, principle of, 37 
Reluctance, 307 
Resistance, 273, 397 
additional, 277 
coefficient, 385 
total magnetic, 307 
Resistivity, electric, 268 
residual, 601 
Resolution, 477 
limit, angular, 477 
Resolving power, 477 
Resonance 
curve, 391 
of oscillatory circuit, 399 
mechanical, 391 
particle, 663 
Resonon, 663 
Right-hand screw rule, 290 
Rochelle salt, 259 
Koenen equivalent man (rem), 


Roentgen rays, 435 
Rotation, 30 
angle of, 30 
about fixed axis, 32 
free, 83 
specific, 510, 511 
uniform, 31 
Rotational speed, 32 
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Rotatory power, 510, 511 
Rule, right-hand screw, 31 
Rydberg constant, 554, 692 


Satellite line, 569 
Scattering, combination, 568 
of light, 
molecular, 485 
Schrödinger, steady-state equa- 
tion, 557 
vae pendent wave equation, 


wave function, 538 
Screens, complementary, 473 
Seat of emf, 
Second, 683, 687 
Second radiation constant, 694 
Seignette’s salt, 259 
Self-inductance, 328 
Self-induction, 327 
Semiconductor(s), 615 

diode, 625 
Series limit, 555 
Shear, 

modulus, 585 

plane, 585 

strain, unit, 585 
Siemens, 685 
Sign rule for emf, 274 
Sites, lattice, 127 
Snell’s law, 437 
Soddy’s displacement law, 636 
Solenoid, 300 
Solid, 579 

amorphous, 579 

solution, 588 
Sonic waves, phase: velocity of, 


Sound, audible, 400 
Sources, coherent, 414 
Space 
charge layer, 623 
homogeneity of, 46 
isotropy of, 82 
lattice, 579 
quantization, 559 
Spectroscopy, nuclear, 647 
Spectrum, band, 
continuous absorption, 484 
diffraction, 473 
dispersion, 487 
of electrons, band energy, 611 
electron-vibrational, 568 
frequency, 379 


light absorption, 483 

line X-ray, 572 

mass, 317 

molecular, 566 

oscillation, 379 

rotation, 567 

rotational, 32 

vibrational, 567 
Spin, electron, 336 

isotopic, 669 

nuclear, 627 

quantum number, inner, 627 
State 

function, 132 

inverted, 575 

stable, 65 

of ue namig equilibrium, 


unstable, 65 
Statics, 19 
Statistical laws, 128 
Steady, field, 

state, 129 

postulate, 557 


T constant, 516, 


law, 516 

Steiner’s theorem of 
axes, 75 

Steradian, 681 
Stokes, 687 

line, 568 

theorem, 356 
Stoletov law, 526 
Storage factor, 387 
Strain, unit, 583 
Strangeness, 670 
Strength, magnetic field, 345 

ultimate, 584 


parallel 


Stress, 583 

ir a diagram, tensile, 
Sublimation, 585 

Subshell, 562 

Superconductivity, 601 
Superfluidity, 206 

Superposition of fields, prin- 


ciple of, 108 
of forces, 42 
Surface, equipotential, 242 
tension, 
wave, 402 
of wave, ray, 500 


Surface-active substance, 213 
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Susceptibility, complex dielectric, 


490 

dielectric, 253 

electric, 253 , 

magnetic, 342 
Suspension, centre of, 78 

point of, 369 
S-wave, 437 
Synchrocyclotron, 321 
Synchrotron, 321 
System, linear, 384 

of particles, 20 


Target nucleus, 649 
Tautochronous paths, 453 
Temperature, 129 
absolute, 130 
absolute zero, 130 
scale, absolute, 130 
empirical, 130 
thermodynamic, 150, 184 
characteristic, 607 
colour, 522 
critical, 202 
Debye characteristic, 609 
degeneracy, 593 
- electron, 289 
luminance, 523 
of solidification, 587 
Temperature- indicating body, 130 
Tensile strength, 584 
Term, 
Tesla, 685 
Test charge, electric, 225 
Thermal conductiivty, 173 
Thermal efficiency, 179 
Thermodynamic method, 128 
Thermodynamic, probability of a 
state, 188. 
processes, 150 
reversible, 181 
system, 129 
adiabatically isolated, 140 
closed, 139 
thermally isolated, 140 
Thermodynamics, 129 
first law of, 140 
phenomenological, 129 
second law of, 183 
third law of, 193 
Thermonuclear chain, 655 
Thomson formula, 373 
Three-level procedure, 576 


Time 

coherence, of nonmonochroma- 

tic waves, 446 
vibrations, 449 P 

dilation, 92 

proper, 92 

reference, zero, 63 

relaxation, 209 

slowing of, 92 

uniformity of, 63 


Tonne, 690 
Top, 78 
Toroid, 299 
Total current law, for a mag- 
netic field, 344 
in vacuum, 304 


Transformation, Galilean, 48 
Lorentz, 89, 442. 
Transistor, 625 
Transition, elements, 563 
temperature to superconducting 
state, 6014 


Translation, 29 

Transmission factor, 440 

Transmittance, 440 

Transport, equations, 174 
phenomena, 170 z 

Transuranic chemical cłements, 

Trapping level, 619 

Travelling speed, 25 

Travelling wave equation, 401 

Triode, crystal, 625 

Tsiolkovsky’s formula, 45 

Tunnel effect, 554 

Twin paradox, 94 

Two-fluid theory, 206 

Tyndal effect, 485 


Ultimate strength, 584 
Ultraviolet, catastrophe, 518 
radiation, 435 


Umov-Poynting vector, 428 
Umov vector, 41 
Unit, base, 678 
derived, 682 
fundamental, 676 
off-system, 689 
of physical quantity, 676 
Units, system of, 676 
Universal gravitation, 
of, 


constant 
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Vacuum, 175 
degrees of, 175 
high, 175 
Van de Graaff electrostatic gene- 
rator, 319 
Van der Waals coefficient, 136 
Vaporization, 217 
Vapour, 194 
dry saturated, 202 
supersaturated, 205 
wet, 202 
Variable, external state, 130 
internal, 131 a 
state, 129 
thermodynamic, 129 
Variation, harmonic, 365 
Vavilov-Cherenkov radiation, 430, 
492 


Vavilov’s law, 571 
Vector, axial, 30 
current density, 266 
displacement, 23 
of external forces, principal, 43 
flux of magnetic induction, 305 
magnetic induction, 290 
magnetization, 340 
nonlocalized, 37 
polar, 30 ‘ 
polarization, 252 s 
potential gradient, 241 
Poynting, 428 
radius, 21 
Umov, 410 
Umov-Poynting, 428 
Vector diagram method, 365 
Velocity, 24 
absolute, 116 
angular, 30 
areal, 26, 113 
arithmetic mean, 157 
average relative, 159 
bulk, 116 
circular orbital, 114 
drift, 116 
energy transfer, 409 
escape, 114 
exhaust gas,- 45 
group, 413 
instantaneous, 24 
of light, invariance of, 85 
linear, 314 
most probable, 156 
of multistage (combination) 
rocket, characteristic, 45 
orbital, 114 
parabolic, 144 


of pendulum, 391 
phase, 406, 423, 536 
radial, 26 
relative, 116 
of rocket, characteristic, 45 
solar escape, 115 
of sound, 406 
of source, ray, 422 
space, 112 
spin, 78 
of transportation, 116 
transverse, 26 
wave, 404 
of wave, ray, 500 
Verdet constant, 511 
Vibration, 362 
isochronous, 370 
zero, 546 
Vibrator, elementary, 430 
Viscosity, 171, 172 
Visibility of fringes, 449 
Visible radiation, 435 
Volt, 684 
Voltage, 272 
breakdown, 248, 285 
drop, 272 l 
retarding, 525 
source, 394 
voltmeter, 277 
Volume, molar, 133 
specific, 129 


Watt, 683 

Wave(s), acoustic, 399 
centre, 404 
coherent, 414 

partly, 449 

circularly polarized, 426 
clastic, 399 
electromagnetic, 422 
elliptically polarized, 426 
equation, 405 
extraordinary, 499 
incindent plane, 436 
length of standing, 417 
linearly polarized, 426 
matter, 536 
mechanics, 535 
monochromatic, 425 
ordinary, 499 
plane, 403 | 
plane monochromatic, 425 
plane quasi-sine, 412 
radio, 433 
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Wave(s) displacement law, 517 
reference, 478 formula, 518 
reflected plane, 436 Work, 137 
refracted plane, 436 expansion, 137 
signal, 478 of force, element of, 52 
sine, 406 function, 620 
sonic, 399 of. ionization, 282 
source, 399 Working medium, 177 
spherical, 404 Wulff-Bragg condition, 476 


standing, 416 

se ar of, 411 

surface, X radiation, 435 
train, 441, 445 characteristic, 572 


length, harmonic, 446 
travelling, 401 ae ae 
vector, 404 powder photograph, 476 
zone, 435 structure analysis, 476 
Wavefront, 402 X-unit, 689 


Wavenumber, 404 
Weak interaction constant, 664 


Weber, 685 i i 

Weight of a body, 122 S poine Sa pag 
Weightlessness, 123 Youle EOR interference 
Wetted perimeter, 213 method, 454 


Wiig ae = one plate, 465 


Wiedmann-Franz law, 269 
Wien, constant, 517 


TO THE READER 


Mir Publishers would be grateful 
for your comments on the content, 
translation and design of this book. 
We would also be pleased to receive 
any other suggestions you may wish 
to make. 

Our address is; 

Mir Publishers 

2 Pervy Rizhsky Pereulok 
I-110, GSP, Moscow, 129820 
USSR 


Printed in the Union of Soviet Socialist Republics 


